Transmembrane protein as a downstream target of neurotrophin and ephrin receptor tyrosine kinases, DNA encoding same and monoclonal antibodies thereto

ABSTRACT

The present invention relates to an ankyrin repeat-rich membrane spanning (ARMS) polypeptide which is a downstream target of neurotrophin and ephrin receptor tyrosine kinases. This ARMS polypeptide contains 1715 amino acid residues including four transmembrane domains, multiple ankyrin repeats, a sterile α motif domain, and a PDZ-binding motif. Immunofluorescence studies with anti-ARMS antibodies have found ARMS to be localized discreetly to the growth cone of neurons.

CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] The present application claims priority under 35 U.S.C. §119(e) from U.S. provisional application No. 60/256,909, filed Dec. 21, 2000, the content of which is hereby incorporated entirely by reference.

GOVERNMENT LICENSE RIGHTS

[0002] The experiments performed in this application were supported in part by the National Institutes of Health, Grant Nos. NS-21072, HD-23315, and NS-10489. The U.S. Government has a paid-up license in this invention and the right in limited circumstances to require the patent owner to license others on reasonable terms as provided for by the terms of Grant Nos. NS-21072, HD-23315, and NS-10489 awarded by the National Institutes of Health.

BACKGROUND OF THE INVENTION

[0003] 1. Field of the Invention

[0004] The present invention relates to a transmembrane protein which is a downstream target of neurotrophin and ephrin receptors for phosphorylation.

[0005] 2. Description of the Related Art

[0006] The formation of the nervous system requires appropriate connectivity of neurons and their targets both spatially and temporally. Two families of proteins that mediate these actions are the neurotrophins and ephrins. Neurotrophins play a prominent role in the development of the vertebrate nervous system by influencing cell survival, differentiation and cell death events (Levi-Montalcini, 1987; Lewin and Barde, 1996). Neurotrophins also exhibit acute regulatory effects upon neurotransmitter release, synaptic strength and connectivity (Thoenen, 1995; Bonhoeffer, 1996; McAllister et al., 1999). In addition to promoting axonal (Patel et al., 2000) and dendritic branching, neurotrophins serve as chemoattractants for extending growth cones in vitro (Letourneau, 1978; Gundersen and Barrett, 1979; Gallo et al., 1997).

[0007] Ephrins comprise another class of ligands that function in axon guidance, cell migration, axon fasciculation, boundary formation, topographic mapping, and morphogenesis (Frisen et al., 1999). Ephrins are a family of eight proteins that are found associated with the plasma membrane, either through a GPI linkage (as seen in the A subfamily) or as transmembrane proteins (as seen in the B subfamily). Ephrins signal via receptor protein tyrosine kinases, but the biological outcomes are distinct from mitogenic factors such as platelet-derived growth factor (PDGF) and epidermal growth factor (EGF), both of which transmit signals through tyrosine phosphorylation (Bruckner and Klein, 1998; Flanagan and Vanderhaeghen, 1998; Holland et al., 1998). Ephrin receptor-associated molecules such as Crk, Nck, RasGAP, and Fyn are proposed links between the receptor and downstream events such as cell adhesion and cytoskeletal changes. In addition, Grb2, Grb10, and the p85 subunit of phosphatidylinositol-3-kinase (PI-3 kinase) are utilized in ephrin receptor signal transduction (Mellitzer et al., 2000).

[0008] The neurotrophins, NGF, BDNF, NT-3 and NT-4/5, exert their effects through two classes of receptors (Kaplan and Miller, 2000). TrkA, TrkB, and TrkC serve as receptors for NGF, BDNF and NT-4, and NT-3, respectively (Chao, 1992a). The p75 receptor is a member of the TNF receptor superfamily (Smith et al., 1994) and binds to all neurotrophins. Most central and peripheral neurons express p75 together with one or more of the Trks. The p75 receptor, when co-expressed with TrkA, provides a positive influence upon Trk function (Barker and Shooter, 1994; Verdi et al., 1994) and determines specificity of neurotrophin binding and responsiveness (Benedetti et al., 1994; Bibel et al., 1999; Brennan et al., 1999). An association of p75 and Trk receptors has been detected by co-precipitation (Huber and Chao, 1995; Gargano et al., 1997; Bibel et al., 1999) and co-patching of these receptors has been observed using fluorescent labeled antibodies (Ross et al., 1996).

[0009] Receptor tyrosine kinases frequently utilize a number of common intracellular signaling components such as phospholipase C-γ, PI-3 kinase and adaptor proteins such as Shc and Grb2. Common to many of these proteins is their ability to bind to phosphorylated tyrosines via domains such as the SH2 and phosphotyrosine-binding (PTB) domains. How these shared signaling components lead to different biological outcomes is not well understood (Chao, 1992a). Possible mechanisms include receptor utilization of substrates in a differential manner (e.g., differential association/dissociation kinetics), competition for binding between different substrates (Meakin et al., 1999), or recruitment of unique target proteins, such as rAPS and SH2-B (Qian et al., 1998).

[0010] Citation of any document herein is not intended as an admission that such document is pertinent prior art, or considered material to the patentability of any claim of the present application. Any statement as to content or a date of any document is based on the information available to applicant at the time of filing and does not constitute an admission as to the correctness of such a statement.

SUMMARY OF THE INVENTION

[0011] The present invention provides a polypeptide which is a transmembrane protein that is highly expressed in vertebrate central and peripheral nervous systems. This polypeptide, which associates with TrkA and p75 neurotrophin receptors, is a target for phosphorylation by neurotrophin and ephrin receptor tyrosine kinases, enhances neurotransmitter release, and modulates the clustering of proteins involved in ion channel formation, includes the amino acid sequence of: (A) SEQ ID NO:2; (B) SEQ ID NO:4; (C) a fragment of the polypeptide of SEQ ID NO:2; (D) a fragment of the polypeptide of SEQ ID NO:4; (E) a variant polypeptide which is at least 95% identical to SEQ ID NO:2; (F) a variant polypeptide which is at least 95% to SEQ ID NO:4; or (G) a functional derivative or a salt of (A), (B), (C), (D), (E), or (F).

[0012] The present invention also provides an isolated nucleic acid which encodes the polypeptide of the present invention or which hybridizes under stringent conditions to the complement of the nucleotide sequence encoding the polypeptide of the present invention a vector, a transformed host cell containing such a nucleic acid, and a method for producing the polypeptide of the present invention.

[0013] Further provided by the present invention is a molecule containing the antigen-binding portion of an antibody specific for the polypeptide of the present invention.

[0014] A still further aspect of the present invention provides a method for visualizing the growth cone of neurons, where the polypeptide of the present invention serves as a marker for growth cones.

BRIEF DESCRIPTION OF THE DRAWINGS

[0015]FIG. 1 shows the predicted topology of ARMS. Transmembrane domains and various intracellular motifs are depicted.

[0016]FIG. 2 shows the amino acid sequence and comparison of rat (SEQ ID NO:2) and human (SEQ ID NO:4) ARMS proteins. Dash-lined residues denote 11 contiguous ankyrin repeats; bold-faced tyrosine (Y) residues (at positions 399, 409, 441, 444, and 466 of the rat sequence) are evolutionarily conserved among human, rat, Drosophila, and C. elegans; boxed residues are the putative transmembrane domains; italicized residues denote the polyproline stretch; shadowed residues constitute the SAM domain (aa1152-1221); carboxy-most three asterisked amino acids (SIL) encode a PDZ-binding motif.

[0017]FIG. 3 shows the comparison of various cytoplasmic regions of rat (r), human (h), Drosophila (d) and C. elegans (w) ARMS. 1, amino terminal region between the ankyrin repeats and the first transmembrane domain with bold-faced, evolutionarily conserved tyrosines (Y); 2, cytoplasmic region between transmembrane domains 2 and 3; 3 and 4, two carboxy terminal regions; 5, the SAM domain. Sequences for wARMS and dARMS were obtained from accession numbers Z68760 and AAF46710, respectively. The sequence for hARMS was determined from overlapping ESTs with accession numbers BAA86564 and CAB63746. (symbols: “*” identity, “:” strongly similar, “.” weakly similar). The residue numbering of the various cytoplasmic regions corresponds to the residue numbering of the amino acid sequences of rARMS (SEQ ID NO:2), hARMS (SEQ ID NO:4), dARMS (SEQ ID NO:8), and wARMS (SEQ ID NO:6).

[0018]FIGS. 4A and 4B show a Northern analysis of ARMS (FIG. 4A) and a methylene blue staining of the 28S ribosomal band as a loading control (FIG. 4B). A single transcript of 7.0 kb was detected by Northern analysis using a ³²P-labeled ARMS cDNA probe (FIG. 4A). Each lane contained 20 μg of total RNA (with the exception of pancreas and DRG lanes which contained <10 μg each) extracted from various rat tissue.

[0019] FIGS. 5A-5D show the distribution of ARMS mRNA in the adult rat central nervous system by in situ hybridization. A ³³P-labeled cRNA probe was used to assess ARMS mRNA expression. Areas of intense labeling include the mitral cell layer of the olfactory bulb (OB; FIG. 5A), all regions of the hippocampus (HP; FIG. 5B), the Purkinje cell layer of the cerebellum (CB; FIG. 5C), grey matter—most notably in the ventral horn—of the spinal cord (SC; FIG. 5D).

[0020]FIGS. 6A and 6B show the expression in adult rat dorsal root ganglion (DRG) by in situ hybridization. A ³³P-labeled cRNA probe was used to assess mRNA distribution in DRG as depicted in the dark field image (left). The majority of cell bodies of the DRG were positive for ARMS mRNA expression, but notable absences of expression were localized to the large diameter DRG cell bodies as depicted by the arrows in the dark field and the corresponding phase (FIG. 6B) photographs. (scale bars=1 mm)

[0021] FIGS. 7A-7C show expression of ARMS mRNA by in situ hybridization of ARMS in embryonic day 14 (E14) rat. In a coronal section through the midsection of an E14 rat, only spinal cord (sc) and dorsal root ganglion (drg) were positive for ARMS (FIG. 7A). In situ hybridization of ARMS in a midsagittal section (FIG. 7B) and a more lateral section (FIG. 7C) of E14 rat. ARMS mRNA expression was restricted to various brain regions such as the cortex (cx), hippocampus (hp), pons, medulla (med), basal telencephalon (bt), principal and spinal trigeminal nucleus (tn), superior and inferior colliculus (clc) and spinal cord (sc). Multiple ganglia expressed ARMS mRNA, such as the dorsal root ganglion (drg), trigeminal ganglion (tg), geniculate ganglion (gg), vestibular ganglion (vg), and superior cervical ganglion (scg). (white scale bars=1 mm; black scale bar=50 μm)

[0022] FIGS. 8A-8B show the interaction of p75 with ARMS. HEK293T cells were co-transfected with cDNAs encoding full length ARMS, HA-tagged p75, ARMS plus p75, or empty vector. Cells lysates were immunoprecipitated with anti-ARMS 892 antiserum and immunoblotted with anti-HA (FIG. 8A). Expression of p75 receptors was confirmed by immunoblotting with anti-p75 (9992; FIG. 8B).

[0023] FIGS. 9A-9D show coprecipitation of TrkA and ARMS (FIG. 9A) and colocalization of TrkA and ARMS (FIGS. 9B-9D). In FIG. 9A, PC12 615 cells were treated for 10 minutes and 25 hours with NGF (100 ng/ml). Lysates were prepared and subjected to immunoprecipitation with anti-Trk C-14 antibody, followed by immunoblot with anti-ARMS antibody. Normal rabbit IgG was used as a negative control. In FIGS. 9B-9D, immunofluorescence analysis of ARMS and TrkA receptor in sympathetic neurons is shown. SCG sympathetic neurons were grown in the presence of 150 ng/ml NGF, fixed and immunostained as described in the Methods section of Example 1. The ARMS protein and the TrkA receptor were subjected to double immunostaining using an anti-ARMS antiserum (FIG. 9B) and an anti-Trk B-3 (FIG. 9C) monoclonal antibody and analyzed by confocal microscopy. The signal observed in FIG. 9D demonstrates overlap of the two signals (overlay) from immunostaining with anti-ARMS anti-serum and with an anti-TrkB monoclonal antibody. The arrow indicates cell surface co-localization of ARMS and TrkA.

[0024]FIGS. 10A and 10B show tyrosine phosphorylation of ARMS. In FIG. 10A, phosphorylation of ARMS by NGF in PC12 cells is rapid and can be blocked by K252a. The antiserum 892 was used to immunoprecipitate endogenously expressed ARMS from PC12 615 cell lysates. Anti-phosphotyrosine antibody, pY99 was used to assess tyrosine phosphorylation of the immunoprecipitated ARMS. Within 1 minute of NGF treatment, phosphorylation of ARMS could be detected, suggesting a direct phosphorylation by TrkA. Furthermore, 100 nM K252a potently blocked ARMS phosphorylation (top). In lysates of the same samples, TrkA autophosphorylation is shown using pY99 (bottom). The time course of ARMS phosphorylation by NGF in PC12 cells is shown in FIG. 10B. The phosphorylation peaks within 10 minutes (m) and is sustained for at least 25 hours (h) (top). Reprobing of the same blot with 892 demonstrated equivalent levels of immunoprecipitated ARMS from the various lysates (bottom).

[0025]FIG. 11 shows specificity of ARMS phosphorylation. Phosphorylation of ARMS is specifically induced upon NGF, but not EGF, treatment of PC12 615 cells. Two time points, 10 minutes and 2 hours, were examined for tyrosine phosphorylation of ARMS using the following conditions: no ligand (CTRL), 50 ng/ml EGF and 100 ng/ml NGF. To demonstrate the specificity of the ARMS antiserum, 892 (I), preimmune antiserum (P) was used in parallel immunoprecipitations (IP). The lower panel shows the amount of ARMS protein that was immunoprecipitated from the various lysates.

[0026]FIG. 12 shows the effects of other neurotrophins. The neurotrophins BDNF and NT-4/5 induce phosphorylation of ARMS through the TrkB receptor. PC12 cells stably expressing TrkB were treated with either 100 ng/ml BDNF or 100 ng/ml NT4/5 and the phosphorylation of ARMS was measured as described in FIG. 10A. BDNF, and to a lesser extent, NT4/5, were able to induce tyrosine phosphorylation of ARMS. The bottom panel depicts immunoprecipitated ARMS from each lysate.

[0027]FIG. 13 shows induction of ARMS phosphorylation in hippocampal neurons by BDNF. Primary cultures of E17 hippocampal neurons were prepared and treated with 50 ng/ml BDNF for the indicated times. Phosphorylation of ARMS was assessed by immunoprecipitation with anti-ARMS 892 antiserum and Western blotting with anti-phosphotyrosine pY99 antibody (top panel). Equal amounts of ARMS protein were immunoprecipitated from each lysate as shown with reprobing the same blot with 892.

[0028]FIGS. 14A and 14B show the effects of ephrins. In FIG. 14A, ephrin B2 induces ARMS tyrosine phosphorylation in NG108-15 cells expressing EphB2 receptor. Lysates were made from untreated or ligand-stimulated NG108-15 cells (using aggregated ephrin B2, 30-40 minutes) and immunoprecipitated with 892 antiserum. Tyrosine phosphorylation was assessed with pY99 in subsequent Western blots. Equivalent amounts of ARMS were immunoprecipitated as shown in the lower panel. In FIG. 14B, tyrosine phosphorylation of ARMS by ephrin B2 peaks at 30 minutes. Thus, the time course of ARMS tyrosine phosphorylation closely parallels that of receptor autophosphorylation.

[0029]FIGS. 15A and 15B show ARMS/Trk receptor interaction. Expression plasmids containing full length cDNAs for ARMS and TrkA, TrkB and TrkC receptors were transiently transfected into HEK293 cells (2×10⁶ cells/plate) following the calcium-phosphate method. ARMS, Trk and EGF receptor expression were detected by immunoblotting. Cells were lysed in 1% NP-40 lysis buffer containing 20 mM Tris-HCl pH 8, 150 mM NaCl, 1% NP-40, 2 mM EDTA and protease inhibitors (0.15 units/ml aprotinin, 20 uM leupeptin and 1 mM phenylmethylsulphonylfluoride), at 4° C., for 30 min. Immunoprecipitation was performed for 3 hours at 4° C. using 2-3 mg of total protein extract and the Flag agarose-conjugated antibody (Sigma, St. Louis, Mo.). After several washes, immunoprecipitates were analyzed by SDS-PAGE followed by Western blot with different antibodies e.g., antibodies against TrkA, TrkB, TrkC, Arms, or EGFR. Reactive protein bands were visualized by enhanced chemiluminescence detection (Amersham Corp., Piscataway, N.J.).

[0030]FIGS. 16A and 16B show a PC12 Immunofluorescence analysis. PC12 cells with (FIG. 16A) or without (FIG. 16B) NGF treatment cultured in Lab-Tek chamber slides (Nalge Nunc International) coated with collagen and poly-L-lysine were fixed with paraformaldehyde and permeablized with cold methanol. Cells were blocked with PBS containing 10% FCS and incubated with purified antibody (3 μg/ml) against the C-terminus of ARMS protein, 892 (Example 1). Primary antibodies were visualized using fluorescence-conjugated secondary antibodies (FITC-conjugated goat anti-rabbit IgG; Jackson Laboratories). Images were collected on a Leica confocal microscope (Nussloch, Germany) and show that ARMS is localized at neurite tips in PC12 cells after NGF treatment.

[0031] FIGS. 17A-17F show immunolocalization of ARMS and VAMP-2, a synaptic vesicle marker, in hippocampal neurons. Primary cultures of hippocampal neurons were obtained from rats E17-19 and maintained with Neurobasal medium (Gibco) supplemented with B-27 and 0.4 mM glutamine in Lab-Tek chamber slides (Nalge Nunc International) coated with poly-L-lysine. The cells were fixed with paraformaldehyde, permeablized with cold methanol and then blocked with PBS containing 10% FBS, 10% normal goat serum and 5% BSA for at least 30 minutes and incubated with anti-ARMS antibody 892 (FIG. 17A) and VAMP-2 (FIG. 17B) antibodies in blocking solution at room temperature or 4° C. Primary antibodies were detected using fluorescence-conjugated secondary antibodies (FITC-conjugated goat anti-rabbit IgG and rhodamine-conjugated goat anti-mouse IgG (Jackson Laboratories)). Images were collected on a Leica confocal microscope. FIG. 17C is a merged image of FIGS. 17A and 17B). FIGS. 17D-17F represent an enlargement of the staining in the top FIGS. 17A-17C, respectively, with the white arrows designating tips of processes

[0032]FIG. 18 shows the localization of ARMS in axons and growth cones of hippocampal neurons. Cultures of hippocampal neurons were assessed for ARMS expression by indirect immunofluorescence, as described above for FIG. 17A. A concentration of ARMS protein was found at the growth cone (white arrow) and along the axon in a punctate distribution.

[0033]FIG. 19 shows the interaction between ARMS and PDZ-containing proteins GRIP1, GRIP2 and PICK1. Expression plasmids containing myc-epitope tagged GRIP1, GRIP2 or PICK1 were cotransfected with a full length cDNA for ARMS in HEK293 cells. Cells were lysed in 1% NP-40 lysis buffer and immunoprecipitation was carried out with anti-myc antibodies, followed by Western blot for the ARMS protein.

[0034]FIG. 20 shows a proposed schematic model of interactions between Trk receptors, ARMS, PDZ-containing proteins and glutamate receptors.

DETAILED DESCRIPTION OF THE INVENTION

[0035] The present inventors have discovered a novel transmembrane protein, designated ARMS (for Ankyrin Repeat-Rich, Membrane Spanning), in rats and humans which is a downstream target/substrate of neurotrophin and ephrin receptor tyrosine kinases, Trk and Eph, respectively. This ARMS protein/polypeptide according to the present invention was found to have 1,715 amino acids (SEQ ID NO:2 for rat ARMS and SEQ ID NO:4 for human ARMS) containing four putative transmembrane domains, multiple ankyrin repeats, a SAM domain, and a potential PDZ-binding motif.

[0036] The ARMS protein of the present invention is a substrate for tyrosine phosphorylation by the Trk neurotrophin family of tyrosine kinase receptors. These receptors mediate the actions of neurotrophins (NGF, BDNF, NT-3 and NT-4), which are required for the survival and differentiation of neurons during development and after injury. Moreover, neurotrophin receptor signaling is required for a form of memory known as long term potentiation (LTP), formation of synapses, axonal targeting and hyperalgesia (pain). The ARMS protein is specifically a target for neurotrophins and ephrins, which are also responsible for axon guidance. Unlike other phosphotyrosine proteins, the ARMS protein is a specific target for neurotrophins and ephrins, and not for other polypeptide growth factors which use tyrosine kinase receptors.

[0037] Because of its specificity and highly neuronal expression pattern, ARMS is a useful indicator of the biological activity of neurotrophins and ephrins. The ARMS gene is expressed in postmitotic neurons during the stage of development in which extensive axon pathfinding is occurring and is also expressed during adulthood in highly “plastic” regions of the brain such as the hippocampus, cortex, Purkinje cells of the cerebellum, the olfactory bulb, and of the spinal cord motor neurons, regions which are enriched in Trk and Eph receptors.

[0038] The most significant commonality of these neuronal populations is their ability to undergo continued synaptic changes throughout adult life. Neurons of the olfactory bulb are continually renewed and hence must form new synapses; hippocampal neurons can undergo synaptic remodeling and LTP; Purkinje cell dendrites are highly plastic due to their constant structural remodeling; and motor neurons have the capacity to regenerate and to form new synapses with peripheral targets in adults. Peripheral neurons have regenerative properties throughout adulthood, and this process requires axon outgrowth and new synapse formation. Neurotrophins and ephrins are likely candidates in spinal cord regeneration (Frisen et al., 1992; Miranda et al., 1999), and the present inventors believe that ARMS is also utilized in this process.

[0039] The ARMS gene is also expressed in sensory, geniculate, vestibular and sympathetic neurons in the peripheral nervous system. These neural regions control hearing, vision, taste, smell, cardiovascular and motor functions. The ARMS protein is a very abundant neuronal-specific protein that contains consensus docking sites for ion channels and other signal transduction proteins. From immunofluorescence and confocal microscopic studies, there is a dramatic change in the distribution of ARMS in both hippocampal neurons and PC12 cells after treatment with neurotrophins BDNF or NGF, demonstrating that the ARMS protein is localized discretely at growth cones and in synaptic regions of neuronal membranes where neuronal activities are located.

[0040] The predicted integral membrane structure of ARMS led the present inventors to be believe that ARMS may function as an ion channel. The four predicted transmembrane domains of the ARMS protein and its overall structure is reminiscent of the TRP family of ion channels and the capsaicin receptor, VR1 (Caterina et al., 1997; Harteneck et al., 2000). These channels contain six transmembrane domains with intracellular amino and carboxy termini and amino terminal ankyrin repeats. Although ARMS channel activity was not detected in gene transfer experiments, it remains possible that this protein may also serve as a subunit of a channel, or more likely, is involved in clustering or maintenance of ion channels. Such a possibility has been suggested by the ability of the neurotrophin BDNF to elicit hippocampal, cortical, and cerebellar depolarization on a very rapid time scale (Kafitz et al., 1999), presumably through activation of sodium channels. While the mechanism of this activation is unknown, phosphorylation of ARMS by TrkB suggests a role for ARMS in neurotrophin-mediated regulation of neuronal activity, such as changes in electrophysiological activity of neurons.

[0041] Furthermore, the presence of several protein interaction domains strongly supports a role for ARMS in recruiting proteins to Trk receptor tyrosine kinases. Indeed, co-immunoprecipitation studies indicate that ARMS may be an integral part of a higher order Trk-p75 receptor complex. These interactions may not be limited to neurotrophin signaling, as the Eph receptor family is also capable of phosphorylating the ARMS protein.

[0042] Several neuronal proteins were found to bind to the ARMS protein. These proteins include synembryn, synaptotagmin, a-catenin and cdcRel, which are all involved in synaptic vesicle trafficking. ARMS appears to be involved in the neurotransmitter vesicle movements in the nerve cells and may be involved in directing the release of neurotransmitters from neurons. These neurotransmitters may be acetylcholine or dopamine. Another protein that was discovered to interact with the ARMS protein is the regulatory subunit of K⁺channels. This indicates that ARMS may be associated with ion channel activities, which give rise to changes in neuronal activity (action potentials). The significance of these findings stem from the potent effects of neurotrophins upon synaptic plasticity. The phosphorylation of ARMS by Trk and Eph receptors provides a mechanism for transmitting information from neurotrophins to specific regions of synaptic activity in nerve cells.

[0043] One aspect of the present invention is therefore directed to an ARMS polypeptide which associates with TrkA and p75 neurotrophin receptors and is a target for phosphorylation by neurotrophin and ephrin receptor tyrosine kinases. This polypeptide of the present invention contains the rat ARMS amino acid sequence of SEQ ID NO:2, a fragment of the polypeptide of SEQ ID NO:2 or a variant thereof which is at least 95% identical to SEQ ID NO:2, the human ARMS amino acid sequence of SEQ ID NO:4, a fragment of the polypeptide of SEQ ID NO:4 or a variant thereof which is at least 95% identical to SEQ ID NO:4, or a derivative or salt of any of the above.

[0044] Fragments of the full-length ARMS polypeptide of SEQ ID NO:2 or SEQ ID NO:4 are intended to cover any active fragment that retains the biological activity of the full-length ARMS polypeptide. For example, fragments can be readily generated from the full-length ARMS polypeptide where successive residues can be removed from either or both the N-terminus or C-terminus of ARMS, or from peptides obtained thereof by enzymatic or chemical cleavage of the polypeptide. Thus, multiple substitutions are not involved in screening for active fragments of ARMS. If the removal of one or two amino acids from one end or the other does not affect the biological activity after testing in assays described in the Examples and Figures herein, such truncated polypeptides are considered to be within the scope of the present invention. Further truncations can then be carried out until it is found where the removal of another residue destroys the biological activity.

[0045] Preferably, such active fragments of ARMS would contain the PDZ-binding motif in the cytoplasmic C-terminal portion of ARMS and further either contain one or more transmembrane domains of ARMS or be fused as a fusion polypeptide to a transmembrane domain, other than a transmembrane domain of ARMS, which allows a fragment of ARMS to retain the biological activity of the full-length ARMS polypeptide.

[0046] The term “variant” is intended to encompass a variant of the native sequence of SEQ ID NO:2 or SEQ ID NO:4, or of a biologically active fragment thereof, which has an amino acid sequence having at least 85% identity, preferably at least 90% identity, or more preferably at least 95% identity to the native sequence and retains the biological activity thereof. As would be appreciated by those of skill in the art, this term covers naturally-occurring variants as a preferred embodiment.

[0047] “Functional derivatives” as used herein covers chemical derivatives which may be prepared from the functional groups which occur as side chains on the residues or the N- or C-terminal groups, by means known in the art, and are included in the invention as long as they do not destroy the biological activity of the corresponding polypeptide/protein as described herein. Derivatives may have chemical moieties, such as carbohydrate or phosphate residues, provided such derivative have the same biological activity.

[0048] Suitable derivatives may include aliphatic esters of the carboxyl groups, amides of the carboxyl groups by reaction with ammonia or with primary or secondary amines, N-acyl derivatives or free amino groups of the amino acid residues formed with acyl moieties (e.g., alkanoyl or carbocyclic aroyl groups), O-acyl derivatives of free hydroxyl group (e.g., that of seryl or threonyl residues) formed with acyl moieties, or phosphorylated derivatives of free hychoxyl groups in serine and threonine residues. Such derivatives may also include for example, polyethylene glycol side-chains which may mask antigenic sites and extend the residence of the complex or the portions thereof in body fluids.

[0049] Non-limiting examples of such derivatives are described below.

[0050] Cysteinyl residues most commonly are reacted with alpha-haloacetates (and corresponding amines), such as chloroacetic acid or chloroacetamide, to give carboxymethyl or carboxyamidomethyl derivatives. Cysteinyl residues also are derivatized by reaction with bromotrifluoroacetone, alpha-bromo-beta-(5-imidazoyl)propionic acid, chloroacetyl phosphate, -alkylmaleimides, 3-nitro-2-pyridyl disulfide, methyl-2-pyridyl disulfide, p-chloromercuribenzoate, 2-chloromercuri-4-nitrophenol, or chloro-7-nitrobenzo-2-oxa-1,3-diazole.

[0051] Histidyl residues are derivatized by reaction with diethylprocarbonate at pH 5.5-7.0 because this agent is relatively specific for the histidyl side chain. Parabromophenacyl bromide also is useful; the reaction is preferably performed in 0.1 M sodium cacodylate at pH 6.0.

[0052] Lysinyl and amino terminal residues are reacted with succinic or other carboxylic acid anhydrides. Derivatization with these agents has the effect of reversing the charge of the lysinyl residues. Other suitable reagents for derivatizing alpha-amino-containing residues include imidoesters such as methyl picolinimidate; pyridoxal phosphate; pyridoxal; chloroborohydride; trinitrobenzenesulfonic acid; O-methylisourea; 2, 4-pentanedione; and transaminase-catalyzed reaction with glyoxylate.

[0053] Arginyl residues are modified by reaction with one or several conventional reagents, among them phenylglyoxal, 2,3-butanedione, 1,2-cyclodexanedione, and ninhydrin. Derivatization of arginine residues requires that the reaction be performed in alkaline conditions because of the high pKa of the guanidine functional group. Furthermore, these reagents may react with the groups of lysine as well as the arginine epsilon-amino group.

[0054] The specific modification of tyrosyl residues per se has been studied extensively, with particular interest in introducing spectral labels into tyrosyl residues by reaction with aromatic diazonium compounds or tetranitromethane. Most commonly, N-acetylimidazole and tetranitromethane are used to form O-acetyl tyrosyl species and 3-nitro derivatives, respectively.

[0055] Carboxyl side groups (aspartyl or glutamyl) are selectively modified by reaction with carbodiimides (R′—N—C—N—R′) such as 1-cyclohexyl-3-[2-morpholinyl-(4-ethyl)]carbodiimide or 1-ethyl-3-(4-azonia-4,4-dimethlypentyl) carbodiimide. Furthermore, aspartyl and glutamyl residues are converted to asparaginyl and glutaminyl residues by reaction with ammonium ions.

[0056] Glutaminyl and asparaginyl residues are frequently deamidated to the corresponding glutamyl and aspartyl residues. Alternatively, these residues are deamidated under mildly acidic conditions. Either form of these residues falls within the scope of this invention.

[0057] The term “derivatives” is intended to include only those derivatives that do not change one amino acid to another of the twenty commonly-occurring natural amino acids.

[0058] The term “salts” herein refers to both salts of carboxyl groups and to acid addition salts of amino groups of the complex of the invention or analogs thereof. Salts of a carboxyl group may be formed by means known in the art and include inorganic salts, for example, sodium, calcium, ammonium, ferric or zinc salts, and the like, and salts with organic bases as those formed, for example, with amines, such as triethanolamine, arginine or lysine, piperidine, procaine and the like. Acid addition salts include, for example, salts with mineral acids, such as, for example, hydrochloric acid or sulfuric acid, and salts with organic acids, such as, for example, acetic acid or oxalic acid. Of course, any such salts must have substantially similar biological activity to the polypeptide of the invention.

[0059] Another aspect of the present invention is directed to a molecule which contains the antigen-binding portion of an antibody specific for the ARMS polypeptide of the present invention. It should be understood that the term “antibody” is intended to include intact antibodies, such as polyclonal antibodies or monoclonal antibodies (mAbs), as well as proteolytic fragments thereof such as the Fab or F(ab′)₂ fragments. Furthermore, the DNA encoding the variable region of the antibody can be inserted into DNA encoding other antibodies to produce chimeric antibodies (see, for example, U.S. Pat. No. 4,816,567).

[0060] Single chain antibodies can also be produced and used. Single chain antibodies can be single chain composite polypeptides having antigen binding capabilities and comprising a pair of amino acid sequences homologous or analogous to the variable regions of an immunoglobulin light and heavy chain (linked V_(H)-V_(L) or single chain F_(V)). Both V_(H) and V_(L) may copy natural monoclonal antibody sequences or one or both of the chains may comprise a CDR-FR construct of the type described in U.S. Pat. No. 5,091,513 (the entire contents of which are hereby incorporated herein by reference). The separate polypeptides analogous to the variable regions of the light and heavy chains are held together by a polypeptide linker. Methods of production of such single chain antibodies, particularly where the DNA encoding the polypeptide structures of the V_(H) and V_(L) chains are known, may be accomplished in accordance with the methods described, for example, in U.S. Pat. Nos. 4,946,778, 5,091,513 and 5,096,815, the entire contents of each of which are hereby incorporated herein by reference.

[0061] A “molecule which contains the antigen-binding portion of an antibody” is intended to include not only intact immunoglobulin molecules of any isotype and generated by any animal cell line or microorganism, or generated in vitro, such as by phage display technology for constructing recombinant antibodies, but also the antigen-binding reactive fraction thereof, including, but not limited to, the Fab fragment, the Fab′ fragment, the F(ab′)₂ fragment, the variable portion of the heavy and/or light chains thereof, and chimeric or single-chain antibodies incorporating such reactive fraction, as well as any other type of molecule or cell in which such antibody reactive fraction has been physically inserted, or molecules developed to deliver therapeutic moieties by means of a portion of the molecule containing such a reactive fraction. Such molecules may be provided by any known technique, including, but not limited to, enzymatic cleavage, peptide synthesis or recombinant techniques.

[0062] The polypeptide of the present invention can be used as a marker for neuronal cells which have the ability to undergo continued synaptic changes through adult life or for the presence and distribution (i.e, clustering and aggregation) of the ARMS polypeptide in such neuronal cells. More specifically, as demonstrated in Example 2, the polypeptide of the present invention is a marker for growth cones as ARMS was found to be localized discretely at growth cones and in the synaptic regions of neurons. It was also found that ARMS co-localized with Vamp-2, a synaptic vesicle marker, and can therefore be used for the same purposes as Vamp-2.

[0063] A molecule which contains the antigen-binding portion of an antibody specific for the ARMS polypeptide of the present invention can be labeled with a labeling compound and imaged in vitro or in vivo as is well-known in the diagnostic and imaging art. As a preferred embodiment, this molecule can be used in a method for visualizing the growth cone of neurons which method involves contacting a molecule containing the antigen-binding portion of an antibody specific for the ARMS polypeptide of the present invention with neurons to detect the presence of the polypeptide of SEQ ID NO:2 or SEQ ID NO:4 as a marker for the growth cone of neurons. The growth cone of neurons can then be visualized when the molecule containing the antigen-binding portion of an anti-ARMS antibody is detectably labeled, either directly or indirectly, i.e., through the binding of a secondary detectably labeled antibody to the anti-ARMS antibody.

[0064] Further aspects of the present invention is directed to a nucleic acid or polynucleotide which encodes the ARMS polypeptide of the present invention, a vector containing the nucleic acid, and a host cell transformed with such a nucleic acid. Preferably, the nucleic acid includes the nucleotide sequence encoding either the rat ARMS polypeptide of SEQ ID NO:2 or the human ARMS polypeptide of SEQ ID NO:4 or fragments thereof. More preferably, the nucleic acid includes the nucleotide sequence of SEQ ID NO:l or SEQ ID NO:3.

[0065] The nucleic acid according to the present invention is intended to encompass nucleic acids which specifically hybridizes under stringent conditions to the complement of either SEQ ID NO:1 or SEQ ID NO:3. Stringency conditions are a function of the temperature used in the hybridization experiment and washes, the molarity of the monovalent cations in the hybridization solution and in the wash solution(s) and the percentage of formamide in the hybridization solution. In general, sensitivity by hybridization with a probe is affected by the amount and specific activity of the probe, the amount of the target nucleic acid, the detectability of the label, the rate of hybridization, and the duration of the hybridization. The hybridization rate is maximized at a Ti (incubation temperature) of 20-25° C. below Tm for DNA:DNA hybrids and 10-15° C. below Tm for DNA:RNA hybrids. It is also maximized by an ionic strength of about 1.5M Na⁺. The rate is directly proportional to duplex length and inversely proportional to the degree of mismatching.

[0066] Specificity in hybridization, however, is a function of the difference in stability between the desired hybrid and “background” hybrids. Hybrid stability is a function of duplex length, base composition, ionic strength, mismatching, and destabilizing agents (if any).

[0067] The Tm of a perfect hybrid may be estimated for DNA:DNA hybrids using the equation of Meinkoth et al (1984), as

Tm=81.5° C.+16.6(log M)+0.41 (%GC)−0.61(% form)−500/L

[0068] and for DNA:RNA hybrids, as

Tm=79.8° C.+18.5 (log M)+0.58 (%GC)−11.8(%GC)²−0.56(% form)−820/L

[0069] where

[0070] M, molarity of monovalent cations, 0.01-0.4 M NaCl,

[0071] %GC, percentage of G and C nucleotides in DNA, 30%-75%,

[0072] % form, percentage formamide in hybridization solution, and

[0073] L, length hybrid in base pairs.

[0074] Tm is reduced by 0.5-1.5° C. (an average of 1° C. can be used for ease of calculation) for each 1% mismatching.

[0075] The Tm may also be determined experimentally. As increasing length of the hybrid (L) in the above equations increases the Tm and enhances stability, the full-length rat gene sequence can be used as the probe.

[0076] Filter hybridization is typically carried out at 68° C., and at high ionic strength (e.g., 5-6×SSC), which is non-stringent, and followed by one or more washes of increasing stringency, the last one being of the ultimately desired high stringency. The equations for Tm can be used to estimate the appropriate Ti for the final wash, or the Tm of the perfect duplex can be determined experimentally and Ti then adjusted accordingly.

[0077] Hybridization conditions should be chosen so as to permit allelic variations, but avoid hybridizing to other genes. In general, stringent conditions are considered to be a Ti of 5° C. below the Tm of a perfect duplex, and a 1% divergence corresponds to a 0.5-1.5° C. reduction in Tm. Typically, rat clones were 95-100% identical to database rat sequences, and the observed sequence divergence may be artifactual (sequencing error) or real (allelic variation). Hence, use of a Ti of 5-15° C. below, more preferably 5-10° C. below, the Tm of the double stranded form of the probe is recommended for probing a rat cDNA library with a rat DNA probe or a human cDNA library with a human DNA probe.

[0078] As used herein, highly stringent conditions are those which are tolerant of up to about 5% sequence divergence. Without limitation, examples of highly stringent (10° C. below the calculated Tm of the hybrid) conditions use a wash solution of 0.1×SSC (standard saline citrate) and 0.5% SDS at the appropriate Ti below the calculated Tm of the hybrid. The ultimate stringency of the conditions is primarily due to the washing conditions, particularly if the hybridization conditions used are those which allow less stable hybrids to form along with stable hybrids. The wash conditions at higher stringency then remove the less stable hybrids. A common hybridization condition that can be used with the highly stringent wash conditions described above is hybridization in a solution of 6×SSC (or 6×SSPE), 5×Denhardt's reagent, 0.5% SDS, 100 μg/ml denatured, fragmented salmon sperm DNA at an appropriate incubation temperature Ti.

[0079] A still further aspect of the present invention is directed to a method for producing a polypeptide of the present invention by culturing in a nutrient medium a host cell transformed with the nucleic acid according to the present invention, as discussed above, to produce and express the polypeptide and then recovering the produced polypeptide.

[0080] Having now generally described the invention, the same will be more readily understood through reference to the following examples which are provided by way of illustration and is not intended to be limiting of the present invention.

EXAMPLE 1

[0081] To define proteins associated with the neurotrophin receptors, the intracellular domain of the p75 neurotrophin receptor was used as bait in a two-hybrid screen of a dorsal root ganglion library. The properties of the novel transmembrane protein according to the present invention, designated ARMS, which contains a number of interesting features, including multiple ankyrin repeats, four putative transmembrane domains, a sterile alpha motif (SAM) domain, and a consensus PDZ-binding motif, is reported below. Interestingly, ARMS does not contain SH2 or PTB domains, suggesting it confers signaling specifity downstream of receptor tyrosine kinases in a manner distinct from classical adaptor proteins. An analysis of the structure and distribution of this protein suggests that it functions during development of the nervous system. Most important, ARMS is phosphorylated after treatment with NGF, BDNF and ephrin B2, indicating that it is a critical link between cell surface receptors and intracellular signaling events for both the neurotrophin and the ephrin families. The materials and methods used in the experiments in this example and the results obtained are provided below and discussed.

MATERIALS AND METHODS

[0082] Materials

[0083] NGF was obtained from Harlan (Indianapolis, Ind.); BDNF from PeproTech (Rocky Hill, N.J.); NT4/5 was a generous gift of Regeneron (Tarrytown, N.Y.); EGF was obtained from Intergen (Purchase, N.Y.); and K252a from Calbiochem (La Jolla, Calif.).

[0084] Construction of Two-Hybrid DRG Library and Yeast Two-Hybrid Screen

[0085] A cDNA library was constructed into unique Bst XI/Not I sites of a modified version of pJG4-5. Briefly, polyadenylated RNA, which had been purified from adult mouse and P1 rat DRG using Trizol (Gibco, Gaithersburg, Md.) and the PolyA Tract System (Promega, Madison, Wis.), was used as template for reverse transcription (Gibco, Gaithersburg, MD) with an oligo-dT/Not I primer. Subsequent ligation into pJG4-5 and electroporation into DH5α yielded a library of approximately 10⁶ cfu and average insert size of 1.5-2.0 kilobases. A two-hybrid interaction screen, based on the LexA system (Gyuris et al., 1993), was performed in EGY48. The bait consisted of the entire cytoplasmic region of rat p75 as an in-frame fusion with the LexA DNA binding domain. Library cDNAs were expressed as in-frame fusions with the Gal4 transcriptional activation domain. Approximately 90 million yeast transformants were screened for their ability to survive in the absence of leucine. Sequence analysis identified a novel cDNA clone of approximately 2.5 kb that corresponded to the carboxy terminal 250 amino acids of ARMS.

[0086] Isolation of ARMS cDNA

[0087] Several rat brain libraries (young adult whole brain, adult whole brain and adult hippocampus) were screened to obtain the full-length ARMS cDNA. Criteria used to establish the initiator methionine include the following: assessment of multiple, independent cDNA fragments spanning the start site, an upstream, in-frame stop codon, and conformity to the Kozak consensus sequence.

[0088] Northern Blotting and in situ Hybridization

[0089] Total RNA was extracted from various rat tissues using Trizol reagent (Gibco, Gaithersburg, Md.). Twenty micrograms of total RNA were loaded per lane (with the exception of pancreas and DRG RNAs, which were <10 μg per lane), electrophoresed through a 2.2M formaldehyde/1% agarose gel, transferred to nylon membrane (Qiagen, Santa Clarita, Calif.), baked for 2 hours at 80° C., and probed with a cDNA fragment of ARMS labelled with ³²P-dCTP using Ready-To-Go (Amersham Pharmacia Biotech, Piscataway, N.J.). In situ hybridization was performed as previously described (Lai and Lemke, 1991). Briefly, 30 μm paraformaldehyde-fixed brain sections from adult rat or whole embryos were slide-mounted, hybridized with a ³²P-labelled cRNA probe generated by T7 RNA polymerase from a PCR-generated ARMS fragment, and washed at a final stringency of 0.1×SSC at 60° C. for 35 minutes. Emulsion-dipped slides were exposed for various times (days to weeks) prior to developing and then counterstained with thionin.

[0090] Cell Culture

[0091] Human embryonic kidney (HEK) 293 and 293T cells were maintained in Dulbecco's Modified Eagle's Medium (DMEM) containing 10% fetal bovine serum (FBS) supplemented with 100 U/ml penicillin, 100 μg/ml streptomycin, and 2 mM glutamine. Native PC12 cells, TrkA-overexpressing PC12 cells (615) (Hempstead et al., 1992), and TrkB-overexpressing PC12 cells were maintained in DMEM containing 10% FBS and 5% heat-inactivated horse serum with 30 U/ml penicillin, 30 μg/ml streptomycin, 2 mM glutamine, and 200 μg/ml G418.

[0092] SCG neurons were prepared from P2 rats and cultured on collagen-coated coverslips in C-medium (minimum essential medium containing 10% fetal bovine serum (FBS) supplemented with 0.4% glucose and 2 mM L-glutamine) with 150 ng/ml 2.5S NGF (Harlan, Indianapolis, Ind.). To inhibit growth of non-neuronal cells, neurons were cultured in the presence of 24.6 μg/ml 5-fluoro-2-deoxyuridine and 24.4 μg/ml uridine.

[0093] Dissociated primary cultures of hippocampal neurons from embryonic day 17 (E17) rats were prepared from timed-pregnant Sprague-Dawley rats following published procedures (Aibel et al., 1998). Following dissection of the hippocampus, the meninges were removed. The tissue was briefly minced with fine forceps and then triturated through a fire polished pasteur pipet. Cells were counted and plated in Neurobasal media supplemented with B27 (Gibco, Gaithersburg, Md.) on cell culture dishes coated overnight with 0.01 mg/ml poly-D-lysine. Cells were grown in a humidified incubator with 5% CO₂ at 37 C.

[0094] Antibodies

[0095] To characterize the expression of the ARMS protein, polyclonal antibody against the carboxy terminus of ARMS was generated. A bacterially expressed GST fusion protein with the carboxy-terminal 180 amino acids of ARMS was purified and used as antigen to generate rabbit antiserum (Cocalico Biologicals, Reamstown, Pa.). The specificity of this antiserum was determined by Western blot analyses of HEK293T cells transfected with a FLAG-tagged, full-length cDNA of ARMS. Lysates of transfected HEK293T cells, but not untransfected cells, displayed a specific 190 kilodalton species after immunoblotting with either an anti-FLAG antibody or anti-ARMS serum, 892.

[0096] Transfection of Mammalian Cells, Immunoprecipitation and Immunoblotting

[0097] For transient transfection experiments, HEK293 or HEK293T cells plated at 70-80% confluency in 10 cm dishes were subjected to calcium phosphate transfection with different combinations of the mammalian expression plasmids containing cDNAs for ARMS, rat TrkA, and rat HA-tagged p75 (Khursigara et al., 1999). PC12 615 cells or transiently transfected cells were harvested and lysed in 1 ml TNE buffer (10 mM Tris (pH 8.0), 150 mM NaCl, 1 mM EDTA, and 1% NP40) containing 0.12 mg/ml phenylmethylsulfonyl fluoride (PMSF), 2 μg/ml leupeptin, 1 μg/ml aprotinin, 10 mM sodium fluoride (NaF), and 1 mM sodium orthovanadate (Na₃VO₄) for 30 min on ice. Following centrifugation and removal of the insoluble fraction, protein concentration of the supernatant was determined by the Bio-Rad Protein Assay reagent (Bio-Rad Laboratories, Hercules, Calif.) with bovine serum albumin as the standard. Cell lysates of equivalent protein content were incubated for 4 hours to overnight with rotation at 4° C. with either anti-pan-Trk polyclonal antibody, C14 (1.5 μg) or anti-ARMS (892) antiserum (1:100). The immune complexes were immobilized on protein A-Sepharose beads (Sigma), washed six times with ice-cold TNE buffer, boiled in SDS-sample buffer, separated by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE), and transferred onto PVDF membrane (Millipore, Bedford, Mass.). Immunoblotting was carried out by first blocking membranes in TBST buffer (20 mM Tris (pH 7.5), 500 mM NaCl, 0.1% Tween-20) containing 5% BSA for pY99 and 5% nonfat milk for others, and then incubating for 2 hours at room temperature or overnight at 4° C. in TBST buffer containing one of the following primary antibodies: anti-Trk 44 serum (1:2000); anti-phosphotyrosine pY99 antibody (0.1 μg/ml) (Santa Cruz Biotechnology, Santa Cruz, Calif.); anti-ARMS (892) antiserum (1:1000). Membranes were washed with TBST buffer and then incubated with horseradish peroxidase (HRP)-conjugated goat anti-rabbit (Roche Molecular Biochemicals, Indianapolis, Ind.) or goat anti-mouse secondary antibodies (Jackson ImmunoResearch Laboratories, West Grove, Pa.) at 1:10,000 and 1:7500 dilutions, respectively. Immunoreactive protein bands were detected by enhanced chemiluminescence using ECL reagents (Amersham Pharmacia Biotech, Piscataway, N.J.).

[0098] Immunofluorescence Analysis

[0099] SCG neurons were fixed with 4% paraformaldehyde and blocked with PBS containing 0.075% saponin and 10% FBS or normal goat serum (NGS). Cells were then incubated with anti-ARMS (892, 1:1000) or anti-Trk B-3 antibody (1 μg/ml) in blocking buffer. Primary antibodies were visualized using fluorescence-conjugated secondary antibodies (Cy3-conjugated goat anti-mouse IgG (Jackson Lab; 1:200) or FITC-conjugated goat anti-rabbit IgG (Jackson Lab; 1:100)). Images were collected on a BioRad confocal microscope.

RESULTS

[0100] Cloning of Ankyrin Repeat-Rich, Membrane Spanning (ARMS)

[0101] Neurotrophins exert many biological activities, but few signaling molecules have been found to be specific for neurotrophin receptor signaling. The majority of proteins that serve as substrates for neurotrophin receptors are utilized by other receptor families. Effects on cellular and axonal migration, nerve regeneration, and apoptosis have been implicated for the p75 neurotrophin receptor, and effects on long term potentiation and synaptic transmission have been attributed to the Trks. To define unique molecules in neurotrophin signaling, a two-hybrid screen was undertaken using a rat dorsal root ganglion library as prey and the cytoplasmic domain of p75 as bait. This report describes a p75-interacting protein that is a downstream target for Trk signaling.

[0102] A positive cDNA clone was identified that encoded the carboxy terminal portion of a novel 1,715 amino acid protein. The predicted amino acid sequence contained eleven contiguous, 33 amino acid ankyrin repeats in the N-terminal domain and four putative membrane-spanning regions. This protein was designated as ARMS for Ankyrin Repeat-rich, Membrane Spanning. The overall topology of ARMS is decidedly different from other transmembrane proteins. Other distinguishing features of this protein include a SAM domain, a polyproline stretch, and a potential PDZ-binding motif (FIGS. 1 and 2). These motifs represent candidate protein-protein interaction domains.

[0103] An extensive search of GenBank databases for proteins similar to ARMS identified many ankyrin-containing proteins; however, proteins sharing homology with ARMS in regions outside of the ankyrin repeats were notably absent. Analysis of C. elegans and Drosophila databases revealed ARMS orthologs in these organisms. Further analysis of human databases allowed the present inventors to determine the sequence of human ARMS from two overlapping ESTs in the database. The presence and conservation of ARMS sequences from nematodes to humans suggests that this protein may serve evolutionarily conserved functions. Between rat and human, the amino acid identity is 91% and similarity is 94% (FIG. 2). Comparison of ARMS sequences among human, rat, Drosophila and C. elegans revealed similarity in overall structure: multiple ankyrin repeats in the amino terminus, four putative transmembrane domains, SAM domains, and consensus PDZ-binding motifs at the carboxymost three amino acids (“SIL” in human and rat; “TKL” in Drosophila, and “SDA” in C. elegans). Despite the relatively divergent carboxy termini among the ARMS homologs, the presence of potential PDZ recognition sequences in all of these proteins indicates that conserved protein-protein interactions for ARMS may exist.

[0104] Interestingly, the regions most highly conserved were not the transmembrane (TM) domains, but certain regions surrounding them, namely the ankyrin repeats, the amino terminal portion between the ankyrin repeats and the first TM domain, the “loop” region between TM domains 2 and 3, and several stretches of amino acids in the carboxy terminus downstream of TM 4, including the SAM domain (FIG. 3). The regions highlighted in FIG. 3 range between 20% to 42% identical and 56% to 77% similar, whereas the TM domains are only 1% identical and 43% similar. It is interesting to note that embedded in these subdomains are conserved tyrosine residues; these potential phosphorylation sites are conserved among all four organisms examined (FIG. 3, part 1). Although mutations in C. elegans and Drosophila ARMS have not yet been described, the C. elegans homolog, F36H1.2, is most similar to the ankyrin-related gene, UNC-44 (28%), which has a role in axon guidance, axonogenesis, and neuronal development (Otsuka et al., 1995).

[0105] Distribution of ARMS Transcripts in Rat

[0106] In rat tissue, expression of ARMS mRNA was assessed by Northern blot analysis using a probe directed against the membrane-spanning regions of ARMS (FIG. 4A). A single transcript of approximately 7.0 kilobases was detected. Although ARMS mRNA could be detected in several non-neuronal tissues, it was most abundant in the nervous system.

[0107] The expression of ARMS by in situ hybridization in adult rat brain was examined. Several populations of neurons were found to express ARMS mRNA, including mitral cells and cells of the glomerular layer of the olfactory bulb, all regions of the hippocampus, Purkinje cells of the cerebellum, and grey matter (most notably in presumed large motor neurons) of the spinal cord (FIGS. 5A-5D). One shared property of these neuronal populations is their ability to undergo synaptic changes throughout adulthood.

[0108] During defined periods of rodent embryonic development, subpopulations of sensory neurons require distinct neurotrophins and their cognate receptors for survival (Pinon et al., 1996). The neurotrophin receptor system remains functional through adulthood. Correspondingly, ARMS expression coincides with that of the Trks and p75. As shown by Northern analysis (FIG. 4A) and in situ hybridization (FIGS. 6A-6B), ARMS expression persists in the adult DRG. In addition to an absence of silver grains over the nerve fibers, there was a notable absence of ARMS message in a subset of DRG cell bodies corresponding to large diameter neurons. Although ARMS mRNA can be detected in all populations of DRG neurons, much lower expression in large diameter neurons suggests a less prominent role in proprioception.

[0109] More striking was the restricted expression of ARMS during development. In general, during rat nervous system development, the period of embryonic growth between E11 to E14 is a time of massive proliferation of the neuroepithelium. By E14, the first sets of postmitotic neurons are undergoing differentiation. In E14 rat embryos, ARMS was expressed in both spinal cord and dorsal root ganglia (FIG. 7A). These neuronal populations are among the first in the nervous system to undergo differentiation, and it is during this window of development that these postmitotic neurons are actively seeking and making connection with their targets. Additionally, several neuroanatomical loci of the developing brain expressed ARMS mRNA. In the midsagittal plane, the hippocampus, cortex, pons, and medulla were positive for ARMS expression (FIG. 7B). Regions lateral to the midline such as the basal telencephalon, superior/inferior colliculus, principal trigeminal sensory nucleus, and multiple ganglia, including trigeminal, geniculate, vestibular, and superior cervical (FIG. 7C) showed significant levels of ARMS transcripts.

[0110] Association of ARMS with NGF Receptors

[0111] Since the ARMS protein was originally identified in a yeast two-hybrid assay from its association with the cytoplasmic domain of the p75 receptor, the laboratory of the present inventors tested whether ARMS was capable of an interaction with p75 in cultured cells. HA-tagged p75 cDNA and the ARMS cDNA were co-transfected in HEK293T cells. After immunoprecipitation with an antibody (892) made against the C-terminal region of the ARMS protein, the p75 receptor could be readily detected by Western analysis (FIGS. 8A and 8B). This transfection experiment indicated that ARMS and p75 are capable of forming a complex in HEK293T cells.

[0112] Previous studies indicated that p75 and the TrkA receptor can exist in a complex (Huber and Chao, 1995; Gargano et al., 1997; Bibel et al., 1999). To establish whether an interaction between TrkA and ARMS occurs, an immunoprecipitation experiment in PC12 cells expressing elevated levels of TrkA was carried out (Hempstead et al., 1992). An NGF-dependent association between ARMS and the TrkA receptor was detected following immunoprecipitation of TrkA and immunoblotting with the anti-ARMS antibody, 892 (FIG. 9A). The association between ARMS and TrkA persisted 25 hours after NGF treatment. The endogenous association of TrkA and ARMS indicates that ARMS may exist in a ligand-dependent complex with Trk receptors.

[0113] To investigate further the distribution and potential co-localization of ARMS and Trk in neuronal cells, indirect immunofluorescence experiments in primary cultures of rat sympathetic neurons. Using the 892 antibody against ARMS (FIG. 9B) and a monoclonal antibody directed against Trk (FIG. 9C), co-localization of ARMS and TrkA on the cell surface of sympathetic neurons was observed (FIG. 9D, arrow). These results support the co-immunoprecipitation studies that suggest a physical association between Trk and ARMS.

[0114] Tyrosine Phosphorylation of ARMS by NGF

[0115] The high degree of correspondence in the expression of ARMS and neurotrophin receptors and the endogenous association with TrkA receptors led the present inventors to postulate that ARMS might function as a target for Trk receptor phosphorylation. To investigate this possibility, cell lysates from NGF-treated PC12 cells were prepared and the ARMS protein was immunoprecipitated with anti-ARMS antiserum, 892. Phosphorylation of ARMS protein was visualized by immunoblotting with an anti-phosphotyrosine antibody, pY99. Tyrosine phosphorylation of ARMS was detectable within a minute of NGF addition (FIG. 10A, top), closely following the time course of TrkA autophosphorylation (FIG. 10A, bottom). With continued NGF treatment, the phosphorylation of ARMS persisted for 25 hours (FIG. 10B). Pretreatment with the alkaloid-like compound K-252a (100 nM), which specifically inhibits NGF-mediated activity by selectively blocking the kinase activity of TrkA (Koizumi et al., 1988; Berg et al., 1992), completely abolished the tyrosine phosphorylation of ARMS. Hence, the ARMS protein represents a novel downstream target for the TrkA receptor tyrosine kinase.

[0116] To explore the specificity of ARMS phosphorylation, the effects of EGF and NGF were compared in PC12 cells. The phosphorylation of ARMS was considerably more robust following NGF treatment (FIG. 11), even though both TrkA and EGF receptors were activated. These data indicate that ARMS phosphorylation by TrkA was a specific consequence of NGF, but not EGF, receptor signaling. This may be a consequence of a lack of association of ARMS with the EGF receptor. It is unlikely that ARMS phosphorylation was mediated through the MAP kinase pathway, since treatment of PC12 cells with EGF showed minimal tyrosine phosphorylation of ARMS after 10 minutes or 2 hours (FIG. 11). Also, the pronounced ARMS phosphorylation was not elicited by other growth factors, such as insulin, FGF and insulin growth factor-1, in PC12 cells (data not shown), indicating unique proteins are phosphorylated by the receptor tyrosine kinase, TrkA.

[0117] Response to Other Trophic Factors and Ephrins

[0118] To test the ability of other Trk receptors to phosphorylate ARMS, PC12 cells stably expressing the TrkB receptor were used. Upon treatment of these cells with BDNF (100 ng/ml), ARMS was tyrosine phosphorylated, demonstrating that ARMS can be phosphorylated by TrkB as well as TrkA (FIG. 12). That expression of ARMS is localized to some central regions in which TrkA is not expressed, such as the hippocampus, can be reconciled by the presence of TrkB, which is widely expressed in the adult CNS and which serves as a receptor for BDNF and NT-4/5 (Farinas et al., 1998).

[0119] To verify the ability of BDNF to induce phosphorylation of ARMS in neurons, primary cultures of hippocampal neurons were established. Hippocampal neurons express the TrkB receptor. After treatment with BDNF, ARMS was immunoprecipitated from hippocampal lysates and assessed for tyrosine phosphorylation by immunoblotting with an anti-phosphotyrosine antibody. A similar phosphorylation of ARMS after BDNF treatment in hippocampal neurons was observed (FIG. 13). Thus, in a primary neuronal culture, ARMS is a downstream target of TrkB after activation by BDNF.

[0120] The expression of ARMS during developmental periods of axon outgrowth and synaptogenesis raises the possibility that ARMS is downstream of other known axon guidance regulators. The ephrin family was therefore examined because of its well-established in vitro and in vivo guidance function. As seen in FIG. 14A, the ARMS protein is potently tyrosine phosphorylated by ephrin B2 in a neuronal/glioma hybridoma cell line, NG108-15, that stably expresses EphB2 receptor. EphB2 receptor autophosphorylation and activation occurs over a time period of 30-40 minutes (Holland et al., 1997), in contrast to a much more rapid activation for Trks. Significantly, the phosphorylation of ARMS followed a time course similar to the time course of Eph receptor autophosphorylation (FIG. 14B).

[0121] Unlike the Trk receptors, which are activated upon dimerization by their respective ligands, Eph receptors are aggregated into multimeric complexes to be biologically active. This differential property of receptor aggregation may account for the differential time courses observed between neurotrophins and ephrins. Interestingly, the time course of TrkA and EphB2 phosphorylation of ARMS suggests that ARMS may be phosphorylated directly by the kinase domains of the receptors themselves and not via downstream intermediates. The integral membrane nature of ARMS and its potentially close proximity to receptor tyrosine kinases could facilitate such a process. Most important, these results suggest that ARMS is phosphorylated by receptor tyrosine kinases with established roles in axonal targeting and guidance. It remains to be determined whether ephrin-Eph clustering and subsequent ARMS phosphorylation is involved in attractive or repulsive effects during axon guidance.

DISCUSSION

[0122] ARMS: A Novel Downstream Target for Receptor Tyrosine Kinases

[0123] Although the signaling properties of the Trk receptor tyrosine kinases have been studied extensively, there remain many neurotrophin-stimulated activities in which molecular mechanisms have not been fully defined. These include internalization and transport of receptors, growth cone guidance, and axonal and dendritic branching. A number of common substrates, including phospholipase C-γ, PI-3 kinase, and Shc and Grb2 adaptor proteins, are utilized by many receptor tyrosine kinases, raising the question of how phosphorylation events lead to different biological outcomes (Chao, 1992b). One possibility is that there remain unique substrates that determine the specific nature of neurotrophin responses.

[0124] Here, the properties of a novel tyrosine phosphorylated transmembrane protein are described. The ARMS protein is highly expressed in many neuronal populations and functions as a downstream target for both Trks and Eph receptor. A common feature of ARMS, p75, TrkA, and Eph receptors is that they all contain C-terminal PDZ-binding motifs. It is therefore possible that these proteins are localized to the same subcellular compartment by a PDZ-containing molecule, and this localization may contribute to signaling events both developmentally and in adulthood. Indeed, the EphB2 receptor has been found to be associated with PDZ-containing proteins, and this association is thought to be important for receptor function at neuronal synapses (Torres et al., 1998) and for vestibular axon guidance and ionic homeostasis in the inner ear (Cowan et al., 2000).

[0125] At E14, trigeminal and vestibular axons migrate toward their peripheral targets and commence synapse formation. High levels of ARMS mRNA expression in these subpopulations suggest that ARMS may participate in axonogenesis or axon guidance. A notable absence of ARMS mRNA expression was observed in germinal zones of the developing brain, regions that are extensively proliferating. A general conclusion that can be drawn from the in situ hybridization studies described here is that ARMS-positive neuron populations, for the most part, are post-mitotic and post-migratory. For example, the somatic and visceral motor neurons of the spinal cord are born by E12 and E13, but by E14 are already undergoing axonogenesis (Paxinos, 1995). It is during this post-migratory, differentiative stage that ARMS is highly expressed. Conversely, the presumptive olfactory bulb, which is devoid of neurons at this stage of development, is negative for ARMS expression; the absence of ARMS expression in pre-migratory neuronal populations supports the observation that ARMS expression may be restricted to post-mitotic mitotic and post-migratory neurons. These findings suggest that ARMS may be involved in post-migratory events, such as axon guidance or synaptogenesis.

[0126] Expression of ARMS in the adult central nervous system was observed in regions such as the olfactory bulb, hippocampus, Purkinje cells of the cerebellum, and spinal cord motor neurons. The most significant commonality of these neuronal populations is their ability to undergo continued synaptic changes throughout adult life. Neurons of the olfactory bulb are continually renewed and hence must form new synapses; hippocampal neurons can undergo synaptic remodeling and LTP; Purkinje cell dendrites are highly plastic due to their constant structural remodeling; and motor neurons have the capacity to regenerate and to form new synapses with peripheral targets in adults. Peripheral neurons have regenerative properties throughout adulthood, and this process requires axon outgrowth and new synapse formation. Neurotrophins and ephrins are likely candidates in spinal cord regeneration (Frisen et al., 1992; Miranda et al., 1999), and the present inventors believe that ARMS is also utilized in this process.

[0127] Colocalization of ARMS with Neurotrophin and Ephrin Receptors

[0128] The in vivo utilization of ARMS by the neurotrophin and ephrin receptors is supported by co-expression of these proteins during development. For example, Trk and p75 expression can be detected in all peripheral ganglia of neural crest origin (e.g., superior cervical and dorsal root), spinal cord, and brainstem, regions also high in ARMS mRNA (Yan and Johnson, 1988; Snider, 1994). In the adult, ARMS and p75 expression overlap most notably in the olfactory bulb, Purkinje cells of the cerebellum and motor neurons of the spinal cord. Additionally, Trk receptors are found throughout the nervous system in regions that also express ARMS. For example, TrkA and ARMS are both expressed in the small diameter sensory neurons of the DRG (Averill et al., 1995; Wright and Snider, 1995). Adult hippocampus, cerebellar Purkinje cells, and motor neurons of the spinal cord are a few regions in which the expression of ARMS and TrkB significantly overlaps (Yan et al., 1997).

[0129] In a similar manner, Eph receptor and ARMS expression overlap in adult hippocampus and spinal motor neurons. During development, the Eph receptors are also found in ganglia of neural crest origin (e.g., sensory and vestibular) and in the tectum (Flanagan and Vanderhaeghen, 1998), regions that also express ARMS. Ephrins and their receptors have a well known function in establishment of the retinotectal pathway, and the expression of ARMS in the tectum suggests that ARMS plays a role in this process.

[0130] Given the multiple cell populations that express Trks, Eph receptors and ARMS, such as sensory and motor neurons, our findings raise the possibility of cross-talk between these two receptor systems. ARMS may serve to link or modulate these two signaling pathways in a cooperative or competitive manner. This could potentially be achieved by receptor phosphorylation of similar or different tyrosine residues of ARMS and/or association with other membrane proteins. It is plausible that ARMS may serve as an adaptor protein, given its PDZ-binding motif and other protein interaction domains that would allow for recruitment of multiple, diverse proteins.

EXAMPLE 2 MATERIALS AND METHODS

[0131] Yeast Two-Hybrid Screening

[0132] The two hybrid interaction screenings were performed using 3 different baits named 2.1, 7 and 10 that corresponded to amino acids S1303-L1715, D1056-S1151 and L1603-L1715 of ARMS protein (SEQ ID NO:2), respectively. The bait plasmids were generated by PCR with rat ARMS cDNA and the amplified fragments were ligated into pEG202 at BamHI-NcoI sites as an in-frame fusion with the LexA-DNA binding domain.

[0133] A cDNA library from postnatal day (P1) dorsal root ganglia (DRG) was generated (see Example 1). The DRG library cDNAs were expressed as in-frame fusions with the Gal4 transcriptional activation domain. Approximately 5×10⁷ yeast transformants were screened for growth in absence of leucine. Positive-interacting clones were confirmed with β-galactosidase activity and their specificity was tested using unrelated proteins as baits.

RESULTS

[0134] As demonstrated in Example 1, FIGS. 15A and 15B show that ARMS interacts with TrkA, TrkB, and TrkC receptors but not with EGF receptors.

[0135] The laboratory of the present inventors have also found that following phosphorylation, ARMS becomes localized discretely at growth cones and in synaptic regions of neuronal membranes as visualized using anti-ARMS antibodies in immunofluorescence studies. FIG. 18 is an immunofluorescence analysis that shows ARMS localized in growth cones and axons of hippocampal neurons. It was further demonstrated that ARMS co-localizes with Vamp-2, a synaptic vesicle marker, in hippocampal neurons (FIGS. 17A-17F).

[0136] The results of immunofluorescence and confocal microscopic studies show that there is a dramatic change in distribution of ARMS in both hippocampal neurons and PC12 cells after treatment with BDNF or NGF (FIGS. 16A and 16B), indicating that ARMS becomes concentrated at synaptic sites where neuronal activities are located.

[0137] The laboratory of the present inventors further identified several neuronal proteins (Table 1) that interact/bind with the ARMS protein using a yeast two-hybrid screening system. TABLE 1 Proteins Interacting with ARMS PROTEIN LOCALIZATION AND FUNCTION REFERENCES Cell division control Predominantly expressed in the brain and Beites et al., 1999 related Protein 1 associates with membranes incluing synaptic (CDCrel-1) vesicles. Inhibition of exocytosis Calcium channel beta-3 Predominantly expressed in the brain but also in Castellano et al., 1993 subunit aorta, lung, heart, pancreas and the adrenal gland. Regulation of Ca²⁺ entry into the cell Actin related protein ⅔ Part of a complex implicated in the control of Welch et al., 1997 complex, subunit 3 (21 actin polymerization in cells. kDa) (ARPC3) Synembryn Regulation of neurotransmitter secretion by Miller et al., 2000 controlling the production and consumption of diacylglycerol. GRIP1 and GRIP2 (ABP) enriched in brain synapses Dong et al., 1997 Possible role in either an anchoring of AMPA Srivastava et al., 1998 receptors at synapses or in the trafficking of AMPA receptors

[0138] The proteins in Table 1, identified as interacting with ARMS, include several synaptic vesicle proteins, a calcium channel subunit and proteins involved in clustering glutamate channels (NMDA, kainate). These interactions suggest that the ARMS protein is involved in neurotransmitter vesicle movements in nerve cells, in the release of neurotransmitters, i.e., acethylcholine or dopamine, and in the clustering of ion channels, all of which cause changes in neuronal activity (action potentials).

[0139] There is growing evidence that neurotrophins, such as NGF and BDNF, affect neurotransmitter release and neuronal activity, in addition to promoting neuronal survival (Auld et al., 2001; Albeck et al., 1999; Pozzo-Miller et al., 1999; Jia et al., 1999; Sala et al., 1998; Kang et al., 1995; Korte et al., 1995; Toledo-Aral et al., 1995). The significance of these findings stem from the potent effects of neurotrophins upon synaptic plasticity. The phosphorylation of ARMS by Trk and Eph receptors provides a mechanism for transmitting information from neurotrophins to specific regions of synaptic activity in nerve cells.

[0140] PDZ proteins, such as GRIP1 and GRIP2 in Table, contain a protein interaction domain (PDZ) that bind in a sequence specific way to short C-terminal sequences in a wide variety of membrane proteins. The function of PDZ proteins is to assemble complexes of proteins that perform signaling functions in specific subcellular locations, such as at synapses or growth cones. PDZ-containing proteins have been shown to cluster AMPA receptors, such as GluR2 and GluR3 (Dong et al 1997; Srivastava et al 1998). The Trk receptors were demonstrated to associate with ARMS (Example 1 and FIGS. 15A and 15B). Furthermore, ARMS can recruit PDZ proteins such as GRIP1 and GRIP2 (FIG. 19), which possesses seven PDZ motifs. This interaction between ARMS and GRIP proteins is most likely between the C-terminus of ARMS (SIL) and the first PDZ repeat of GRIP. The AMPA glutamate receptors bind to the 7^(th) PDZ repeat of GRIP through its C-terminal sequence VKI and can also bind to the 3^(rd)-6^(th) DZ repeats of GRIP.

[0141] These binding interactions have been extensively defined for many proteins (Sheng et al., 2001). The interactions between ARMS and GRIP proteins link ARMS to ion channels which can regulate neuronal activity. Since ARMS is a specific and robust substrate for Trk receptor phosphorylation, neurotrophins may therefore communicate with components of synapses through the ARMS and PDZ-containing proteins. This provides a potential mechanism to explain changes in synaptic plasticity that have been detected with neurotrophins (Thoenen, 1995 Poo, 2001).

[0142] Having now fully described this invention, it will be appreciated by those skilled in the art that the same can be performed within a wide range of equivalent parameters, concentrations, and conditions without departing from the spirit and scope of the invention and without undue experimentation.

[0143] While this invention has been described in connection with specific embodiments thereof, it will be understood that it is capable of further modifications. This application is intended to cover any variations, uses, or adaptations of the inventions following, in general, the principles of the invention and including such departures from the present disclosure as come within known or customary practice within the art to which the invention pertains and as may be applied to the essential features hereinbefore set forth as follows in the scope of the appended claims.

[0144] All references cited herein, including journal articles or abstracts, published or corresponding U.S. or foreign patent applications, issued U.S. or foreign patents, or any other references, are entirely incorporated by reference herein, including all data, tables, figures, and text presented in the cited references. Additionally, the entire contents of the references cited within the references cited herein are also entirely incorporated by references.

[0145] Reference to known method steps, conventional methods steps, known methods or conventional methods is not in any way an admission that any aspect, description or embodiment of the present invention is disclosed, taught or suggested in the relevant art.

[0146] The foregoing description of the specific embodiments will so fully reveal the general nature of the invention that others can, by applying knowledge within the skill of the art (including the contents of the references cited herein), readily modify and/or adapt for various applications such specific embodiments, without undue experimentation, without departing from the general concept of the present invention. Therefore, such adaptations and modifications are intended to be within the meaning and range of equivalents of the disclosed embodiments, based on the teaching and guidance presented herein. It is to be understood that the phraseology or terminology herein is for the purpose of description and not of limitation, such that the terminology or phraseology of the present specification is to be interpreted by the skilled artisan in light of the teachings and guidance presented herein, in combination with the knowledge of one of ordinary skill in the art.

REFERENCES

[0147] Aibel L, Martin-Zanca D, Perez P, Chao M V (1998) Functional expression of TrkA receptors in hippocampal neurons. J Neurosci Res 54:424-431.

[0148] Albeck D S, Backman C, Veng L, Friden, P et al. Acute application of NGF increases the firing rate of aged rat basal forebrain neurons. Eur J Neurosci 11, 2291 (1999)

[0149] Auld D S, Mennicken F and Quirion R. Nerve growth factor rapidly induces prolonged acetylcholine release from cultured basal forebrain neurons. J Neuroscience 21, 3375 (2001)

[0150] Averill S, McMahon S B, Clary D O, Reichardt L F, Priestley J V (1995) Immunocytochemical localization of trkA receptors in chemically identified subgroups of adult rat sensory neurons. Eur J Neurosci 7:1484-1494.

[0151] Barker P A, Shooter E M (1994) Disruption of NGF binding to the low affinity neurotrophin receptor p75LNTR reduces NGF binding to TrkA on PC12 cells. Brain Res 664:155-166.

[0152] Beites, C L, Xie H, Bower R, and Trimble W S. (1999) The septin CDCrel-1 binds syntaxin and inhibits exocytosis. Nat Neurosci 2, 434-439.

[0153] Benedetti M, Levi A, Chao M V (1994) Differential expression of nerve growth factor receptors leads to altered binding affinity and neurotrophin responsiveness. J Neurosci 14:29-38.

[0154] Berg M M, Sternberg D W, Parada L F, Chao M V (1992) K-252a inhibits nerve growth factor-induced trk proto-oncogene tyrosine phosphorylation and kinase activity. J Biol Chem 267:13-16.

[0155] Bibel M, Hoppe E, Barde Y A (1999) Biochemical and functional interactions between the neurotrophin receptors trk and p75NTR. Development 126:2129-2140.

[0156] Bonhoeffer T (1996) Neurotrophins and activity-dependent development of the neocortex. Curr Opin Neurobiol 6:119-126.

[0157] Brennan C, Rivas-Plata K, Landis S C (1999) The p75 neurotrophin receptor influences NT-3 responsiveness of sympathetic neurons in vivo. Nat Neurosci 2:699-705.

[0158] Bruckner K, Klein R (1998) Signaling by Eph receptors and their ephrin ligands. Curr Opin Neurobiol 8:375-382.

[0159] Castellano A, Wei X, Birnbaumer L, and Perez-Reyes E. (1993) Cloning and expression of a third calcium channel beta subunit. J Biol Chem 268, 3450-3455.

[0160] Castellano A, Wei X, Birnbaumer L, and Perez-Reyes E. (1993) Cloning and expression of a neuronal calcium channel beta subunit. J Biol Chem 268, 12359-12366.

[0161] Caterina M J, Schumacher M A, Tominaga M, Rosen T A, Levine J D, Julius D (1997) The capsaicin receptor: a heat-activated ion channel in the pain pathway [see comments]. Nature 389:816-824.

[0162] Chao M V (1992a) Growth factor signaling: where is the specificity? Cell 68:995-997.

[0163] Chao M V (1992b) Neurotrophin receptors: a window into neuronal differentiation. Neuron 9:583-593.

[0164] Cowan C A, Yokoyama N, Bianchi L M, Henkemeyer M, Fritzsch B (2000) EphB2 guides axons at the midline and is necessary for normal vestibular function. Neuron 26:417-430.

[0165] Dong H, O'Brien R J, Fung E T, Lanahan A A, Worley P F, and Huganir R L. (1997) GRIP: a synaptic PDZ domain-containing protein that interacts with AMPA receptors. Nature 386, 279-284

[0166] Eshhar et al., (1990) Chimeric T cell receptor which incorporates the anti-tumour specificity of a monoclonal antibody with the cytolytic activity of T cells: a model system for immunotherapeutical approachBr. J. Cancer Suppl., 10:27-9

[0167] Farinas I, Wilkinson G A, Backus C, Reichardt L F, Patapoutian A (1998) Characterization of neurotrophin and Trk receptor functions in developing sensory ganglia: direct NT-3 activation of TrkB neurons in vivo. Neuron 21:325-334.

[0168] Flanagan J G, Vanderhaeghen P (1998) The ephrins and Eph receptors in neural development. Annu Rev Neurosci 21:309-345.

[0169] Frisen J, Holmberg J, Barbacid M (1999) Ephrins and their Eph receptors: multitalented directors of embryonic development. Embo J 18:5159-5165.

[0170] Frisen J, Verge V M, Cullheim S, Persson H, Fried K, Middlemas D S, Hunter T, Hokfelt T, Risling M (1992) Increased levels of trkB mRNA and trkB protein-like immunoreactivity in the injured rat and cat spinal cord. Proc Natl Acad Sci USA 89:11282-11286.

[0171] Gallo G, Lefcort F B, Letourneau P C (1997) The trkA receptor mediates growth cone turning toward a localized source of nerve growth factor. J Neurosci 17:5445-5454.

[0172] Gargano N, Levi A, Alema S (1997) Modulation of nerve growth factor internalization by direct interaction between p75 and TrkA receptors. J Neurosci Res 50:1-12.

[0173] Gross et al., (1989) Expression of immunoglobulin-T-cell receptor chimeric molecules as functional receptors with antibody-type specificity. Proc. Natl. Acad. Sci. USA, 86:10024-8.

[0174] Gundersen R W, Barrett J N (1979) Neuronal chemotaxis: chick dorsal-root axons turn toward high concentrations of nerve growth factor. Science 206:1079-1080.

[0175] Gyuris J, Golemis E, Chertkov H, Brent R (1993) Cdil, a human G1 and S phase protein phosphatase that associates with Cdk2. Cell 75:791-803.

[0176] Harteneck C, Plant T D, Schultz G (2000) From worm to man: three subfamilies of TRP channels. Trends Neurosci 23:159-166.

[0177] Hempstead B L, Rabin S J, Kaplan L, Reid S, Parada L F, Kaplan D R (1992) Overexpression of the trk tyrosine kinase rapidly accelerates nerve growth factor-induced differentiation. Neuron 9:883-896.

[0178] Holland S J, Gale N W, Gish G D, Roth R A, Songyang Z, Cantley L C, Henkemeyer M, Yancopoulos G D, Pawson T (1997) Juxtamembrane tyrosine residues couple the Eph family receptor EphB2/Nuk to specific SH2 domain proteins in neuronal cells. Embo J 16:3877-3888.

[0179] Holland S J, Peles E, Pawson T, Schlessinger J (1998) Cell-contact-dependent signalling in axon growth and guidance: Eph receptor tyrosine kinases and receptor protein tyrosine phosphatase beta. Curr Opin Neurobiol 8:117-127.

[0180] Huber L J, Chao M V (1995) A potential interaction of p75 and trkA NGF receptors revealed by affinity crosslinking and immunoprecipitation. J Neurosci Res 40:557-563.

[0181] Jia M, Li M, Liu X W, Jiang H et al. Voltage sensitive calcium currents are acutely increased by nerve growth factor in PC12 cells. J Neurophysiol 82, 2847 (1999)

[0182] Kafitz K W, Rose C R, Thoenen H, Konnerth A (1999) Neurotrophin-evoked rapid excitation through TrkB receptors [see comments]. Nature 401:918-921.

[0183] Kang H and Schuman E M. Long-lasting neurotrophin-induced enhancement of synaptic transmission in the adult hippocampus. Science 267, 1658 (1995)

[0184] Kaplan D R, Miller F D (2000) Neurotrophin signal transduction in the nervous system. Curr Opin Neurobiol 10:381-391.

[0185] Khursigara G, Orlinick J R, Chao M V (1999) Association of the p75 neurotrophin receptor with TRAF6. J Biol Chem 274:2597-2600.

[0186] Koizumi S, Contreras M L, Matsuda Y, Hama T, Lazarovici P, Guroff G (1988) K-252a: a specific inhibitor of the action of nerve growth factor on PC 12 cells. J Neurosci 8:715-721.

[0187] Korte M, Carroll P, Wolf E, Brem G et al. Hippocampal long-term potentiation is impaired in mice lacking brain-derived neurotrophic factor. Proc Natl Acad Sci 92, 8856 (1995).

[0188] Lai C, Lemke G (1991) An extended family of protein-tyrosine kinase genes differentially expressed in the vertebrate nervous system. Neuron 6:691-704.

[0189] Letourneau P C (1978) Chemotactic response of nerve fiber elongation to nerve growth factor. Dev Biol 66:183-196.

[0190] Levi-Montalcini R (1987) The nerve growth factor 35 years later. Science 237:1154-1162.

[0191] Lewin G R, Barde Y A (1996) Physiology of the neurotrophins. Annu Rev Neurosci 19:289-317.

[0192] McAllister A K, Katz L C, Lo D C (1999) Neurotrophins and synaptic plasticity. Annu Rev Neurosci 22:295-318.

[0193] Meakin S O, MacDonald J I, Gryz E A, Kubu C J, Verdi J M (1999) The signaling adapter FRS-2 competes with Shc for binding to the nerve growth factor receptor TrkA. A model for discriminating proliferation and differentiation. J Biol Chem 274:9861-9870.

[0194] Meinkoth et al, “Hybridization of nucleic acids immobilized on solid supports”, Anal Biochem 138:267-284 (1984)

[0195] Mellitzer G, Xu Q, Wilkinson D G (2000) Control of cell behaviour by signalling through Eph receptors and ephrins. Curr Opin Neurobiol 10:400-408.

[0196] Miller K G, Emerson M D, McManus J R, and Rand J B. (2000) RIC-8 (Synembryn): a novel conserved protein that is required for G_(q)α signaling in the C. elegans nervous system Neuron 27, 289-299.

[0197] Miranda J D, White L A, Marcillo A E, Willson C A, Jagid J, Whittemore S R (1999) Induction of Eph B3 after spinal cord injury. Exp Neurol 156:218-222.

[0198] Otsuka A J, Franco R, Yang B, Shim K H, Tang L Z, Zhang Y Y, Boontrakulpoontawee P, Jeyaprakash A, Hedgecock E, Wheaton V I, et al. (1995) An ankyrin-related gene (unc-44) is necessary for proper axonal guidance in Caenorhabditis elegans. J Cell Biol 129:1081-1092.

[0199] Patel T D, Jackman A, Rice F L, Kucera J, Snider W D (2000) Development of sensory neurons in the absence of NGF/TrkA signaling in vivo [see comments]. Neuron 25:345-357.

[0200] Paxinos G (1995) The Rat Nervous System, 2nd Edition. San Diego: Academic Press, Inc.

[0201] Pinon L G, Minichiello L, Klein R, Davies A M (1996) Timing of neuronal death in trkA, trkB and trkC mutant embryos reveals developmental changes in sensory neuron dependence on Trk signalling. Development 122:3255-3261.

[0202] Poo, Neurotrophins as synaptic modulators. Nature Rev Neurosci 2:24-31 (2001)

[0203] Pozzo-Miller L D, Gottschalk W, Zhang L et al. Impairments in high-frequency transmission, synaptic vesicle docking and synaptic protein distribution in the hippocampus of BDNF knockout mice. J Neuroscience 19, 4972 (1999)

[0204] Qian X, Riccio A, Zhang Y, Ginty D D (1998) Identification and characterization of novel substrates of Trk receptors in developing neurons. Neuron 21:1017-1029.

[0205] Ross A H, Daou M C, McKinnon C A, Condon P J, Lachyankar M B, Stephens R M, Kaplan D R, Wolf D E (1996) The neurotrophin receptor, gp75, forms a complex with the receptor tyrosine kinase TrkA. J Cell Biol 132:945-953.

[0206] Sala R, Viegi A, Rossi F M et al. Nerve growth factor and brain-derived neurotrophic factor increase neurotransmitter release in the rat visual cortex. Eur J Neurosci 10, 2185 (1998)

[0207] Sheng M & Sala C, PDZ domains and the organization of supramolecular complexes, Ann Rev Neurosci 24, 1-29, (2001)

[0208] Smith C A, Farrah T, Goodwin R G (1994) The TNF receptor superfamily of cellular and viral proteins: activation, costimulation, and death. Cell 76:959-962.

[0209] Snider W D (1994) Functions of the neurotrophins during nervous system development: what the knockouts are teaching us. Cell 77:627-638.

[0210] Srivastava S, Osten P, Vilim F S, Khatri L, Inman G, States B, Daly C, DeSouza S, Abagyan R, Valtschanoff J G, Weinberg R J, and Ziff E B (1998) Novel anchorage of GluR2/3 to the postsynaptic density by the AMPA receptor-binding protein ABP. Neuron 21, 581-591.

[0211] Thoenen H (1995) Neurotrophins and neuronal plasticity. Science 270:593-598.

[0212] Toledo-Aral J J, Brehm P, Halegoua S and Mandel G. A single pulse of nerve growth factor triggers long-term neuronal excitability through sodium channel gene induction. Neuron 14, 607 (1995).

[0213] Torres R, Firestein B L, Dong H, Staudinger J, Olson E N, Huganir R L, Bredt D S, Gale N W, Yancopoulos G D (1998) PDZ proteins bind, cluster, and synaptically colocalize with Eph receptors and their ephrin ligands [see comments]. Neuron 21:1453-1463.

[0214] Verdi J M, Birren S J, Ibanez C F, Persson H, Kaplan D R, Benedetti M, Chao M V, Anderson D J (1994) p75LNGFR regulates Trk signal transduction and NGF-induced neuronal differentiation in MAH cells. Neuron 12:733-745.

[0215] Welch M D, DePace A H, Verma S, Iwamatsu A, and Mitchison T J. (1997) The human Arp2/3 complex is composed of evolutionarily conserved subunits and is localized to cellular regions of dynamic actin filament assembly. J Cell Biol 138, 375-384.

[0216] Wright D E, Snider W D (1995) Neurotrophin receptor mRNA expression defines distinct populations of neurons in rat dorsal root ganglia. J Comp Neurol 351:329-338.

[0217] Yan Q, Johnson E M, Jr. (1988) An immunohistochemical study of the nerve growth factor receptor in developing rats. J Neurosci 8:3481-3498.

[0218] Yan Q, Radeke M J, Matheson C R, Talvenheimo J, Welcher A A, Feinstein S C (1997) Immunocytochemical localization of TrkB in the central nervous system of the adult rat [published erratum appears in J Comp Neurol 1997 June 16;382(4):546-7]. J Comp Neurol 378:135-157.

1 8 1 5148 DNA Rattus norvegicus CDS (1)..(5148) 1 atg tca gtt ctt ata tca cag agt gtg ata aat tat gtg gaa gaa gag 48 Met Ser Val Leu Ile Ser Gln Ser Val Ile Asn Tyr Val Glu Glu Glu 1 5 10 15 aat att cct gct ctg aaa gct ctt ctt gag aag tgc aaa gat gta gat 96 Asn Ile Pro Ala Leu Lys Ala Leu Leu Glu Lys Cys Lys Asp Val Asp 20 25 30 gag agg aac gag tgt ggc cag acg ccg ctg atg ctg gct gcg gag cag 144 Glu Arg Asn Glu Cys Gly Gln Thr Pro Leu Met Leu Ala Ala Glu Gln 35 40 45 ggc aat gtg gaa ata gtg aaa gaa ctg tta aag aat gga gcc aac tgc 192 Gly Asn Val Glu Ile Val Lys Glu Leu Leu Lys Asn Gly Ala Asn Cys 50 55 60 aat ctg gaa gat ctg gat aac tgg act gcc ctt ata tcg gca tct aag 240 Asn Leu Glu Asp Leu Asp Asn Trp Thr Ala Leu Ile Ser Ala Ser Lys 65 70 75 80 gag ggg cac atc cac att gtg gag gag ctg ctg aag agc ggg gcc agt 288 Glu Gly His Ile His Ile Val Glu Glu Leu Leu Lys Ser Gly Ala Ser 85 90 95 ctg gag cac cga gac atg ggt gga tgg aca gct ctc atg tgg gcg tgc 336 Leu Glu His Arg Asp Met Gly Gly Trp Thr Ala Leu Met Trp Ala Cys 100 105 110 tac aaa ggc agg acc gac gtg gtc gag ctg ctt ctt tct cat ggt gcc 384 Tyr Lys Gly Arg Thr Asp Val Val Glu Leu Leu Leu Ser His Gly Ala 115 120 125 aac cca agt gtc act ggt ttg cag tat agt gtc tac cca atc atc tgg 432 Asn Pro Ser Val Thr Gly Leu Gln Tyr Ser Val Tyr Pro Ile Ile Trp 130 135 140 gca gcg ggc aga ggc cat gcc gac ata gtg cat ctt cta ctg cag aat 480 Ala Ala Gly Arg Gly His Ala Asp Ile Val His Leu Leu Leu Gln Asn 145 150 155 160 ggt gcg aaa gtc aat tgc tcc gac aag tac gga aca acc ccc ttg gta 528 Gly Ala Lys Val Asn Cys Ser Asp Lys Tyr Gly Thr Thr Pro Leu Val 165 170 175 tgg gct gcc cga aag ggt cat ttg gaa tgt gtg aag cac ttg ttg gcc 576 Trp Ala Ala Arg Lys Gly His Leu Glu Cys Val Lys His Leu Leu Ala 180 185 190 atg gga gct gat gtt gat caa gaa gga gcc aat tcg atg act gca ctt 624 Met Gly Ala Asp Val Asp Gln Glu Gly Ala Asn Ser Met Thr Ala Leu 195 200 205 atc gtg gca gtg aaa gga ggc tac aca cag tcc gtg aaa gag att ttg 672 Ile Val Ala Val Lys Gly Gly Tyr Thr Gln Ser Val Lys Glu Ile Leu 210 215 220 aaa agg aat cca aat gtg aac cta act gac aag gat gga aac aca gct 720 Lys Arg Asn Pro Asn Val Asn Leu Thr Asp Lys Asp Gly Asn Thr Ala 225 230 235 240 ctg atg atc gca tcg aag gag gga cat att gag att gtg cag gac ctg 768 Leu Met Ile Ala Ser Lys Glu Gly His Ile Glu Ile Val Gln Asp Leu 245 250 255 ctg gac gcc gga aca tac gtg aac ata cct gac agg agt ggg gat act 816 Leu Asp Ala Gly Thr Tyr Val Asn Ile Pro Asp Arg Ser Gly Asp Thr 260 265 270 gta ttg atc ggc gct gtg aga ggt ggt cat gtt gaa att gtt cga gca 864 Val Leu Ile Gly Ala Val Arg Gly Gly His Val Glu Ile Val Arg Ala 275 280 285 ctt ctc caa aaa tat gct gat ata gac att aga ggg cag gac aat aag 912 Leu Leu Gln Lys Tyr Ala Asp Ile Asp Ile Arg Gly Gln Asp Asn Lys 290 295 300 acc gcg ctg tat tgg gcc gtt gaa aag gga aat gcg aca atg gtg agg 960 Thr Ala Leu Tyr Trp Ala Val Glu Lys Gly Asn Ala Thr Met Val Arg 305 310 315 320 gat atc ctg cag tgc aat ccg gac acg gaa ata tgc acg aag gat ggt 1008 Asp Ile Leu Gln Cys Asn Pro Asp Thr Glu Ile Cys Thr Lys Asp Gly 325 330 335 gag aca cca ctg ata aaa gcc acc aag atg aga aat att gaa gtg gtg 1056 Glu Thr Pro Leu Ile Lys Ala Thr Lys Met Arg Asn Ile Glu Val Val 340 345 350 gag ctg ctg ctg gat aag ggg gct aag gtg tct gct gtg gac aag aaa 1104 Glu Leu Leu Leu Asp Lys Gly Ala Lys Val Ser Ala Val Asp Lys Lys 355 360 365 ggg gac act ccc ctg cac gtt gcc atc cga ggg agg agc cgg aga ctg 1152 Gly Asp Thr Pro Leu His Val Ala Ile Arg Gly Arg Ser Arg Arg Leu 370 375 380 gcc gaa ctt ctt tta cgg aac ccc aaa gat gga cgg tta ctc tac aga 1200 Ala Glu Leu Leu Leu Arg Asn Pro Lys Asp Gly Arg Leu Leu Tyr Arg 385 390 395 400 ccc aac aaa gca ggc gag act ccc tac aac att gac tgc agc cac cag 1248 Pro Asn Lys Ala Gly Glu Thr Pro Tyr Asn Ile Asp Cys Ser His Gln 405 410 415 aaa agc att tta act caa ata ttt gga gcc aga cac ttg tct cct acg 1296 Lys Ser Ile Leu Thr Gln Ile Phe Gly Ala Arg His Leu Ser Pro Thr 420 425 430 gaa aca gat ggc gac atg ctt ggt tat gat ttg tat agc agt gcc ctg 1344 Glu Thr Asp Gly Asp Met Leu Gly Tyr Asp Leu Tyr Ser Ser Ala Leu 435 440 445 gcc gat att ctc agt gaa ccc acc atg caa cca ccc att tgt gtg ggc 1392 Ala Asp Ile Leu Ser Glu Pro Thr Met Gln Pro Pro Ile Cys Val Gly 450 455 460 ttg tat gca cag tgg gga agt ggg aag tct ttc tta ctc aag aaa cta 1440 Leu Tyr Ala Gln Trp Gly Ser Gly Lys Ser Phe Leu Leu Lys Lys Leu 465 470 475 480 gaa gat gag atg aag acg ttt gca gga cag cag acc gag ccc ctc ttc 1488 Glu Asp Glu Met Lys Thr Phe Ala Gly Gln Gln Thr Glu Pro Leu Phe 485 490 495 cag ttc tcg tgg ctc ata gtg ttc ctc acc ctg ctg ctg tgt ggc ggg 1536 Gln Phe Ser Trp Leu Ile Val Phe Leu Thr Leu Leu Leu Cys Gly Gly 500 505 510 ctc ggc cta gtg ttt gct ttc ccg gtt gat aca aac ctc gcc ata gcc 1584 Leu Gly Leu Val Phe Ala Phe Pro Val Asp Thr Asn Leu Ala Ile Ala 515 520 525 att tca ttg agt ttc ctg gct ctc ata tat ata ttc ttc atc gtc att 1632 Ile Ser Leu Ser Phe Leu Ala Leu Ile Tyr Ile Phe Phe Ile Val Ile 530 535 540 tac ttc ggt gga cgg cgg gaa gga gag agt tgg aac tgg gcc tgg gct 1680 Tyr Phe Gly Gly Arg Arg Glu Gly Glu Ser Trp Asn Trp Ala Trp Ala 545 550 555 560 ctc agt acc aga ctg gca aga cac att ggg tac ctg gag ctc ctc ttc 1728 Leu Ser Thr Arg Leu Ala Arg His Ile Gly Tyr Leu Glu Leu Leu Phe 565 570 575 aaa ctg atg ttt gtg aac cca cct gaa ctg ccg gag caa act aca aag 1776 Lys Leu Met Phe Val Asn Pro Pro Glu Leu Pro Glu Gln Thr Thr Lys 580 585 590 gct tta cct gtg agg ttt ttg ttt aca gat tac aat aga ctc tcc agt 1824 Ala Leu Pro Val Arg Phe Leu Phe Thr Asp Tyr Asn Arg Leu Ser Ser 595 600 605 gtg gga gga gaa act tcc ttg gct gaa atg atc gcg acc ctc tca gat 1872 Val Gly Gly Glu Thr Ser Leu Ala Glu Met Ile Ala Thr Leu Ser Asp 610 615 620 gct tgt gag aga gaa ttc ggc ttt ttg gct acc cgt ctt ttt cga gtc 1920 Ala Cys Glu Arg Glu Phe Gly Phe Leu Ala Thr Arg Leu Phe Arg Val 625 630 635 640 ttc agg act gaa gag tct cag ggt aaa aag aaa tgg aaa aaa aca tgc 1968 Phe Arg Thr Glu Glu Ser Gln Gly Lys Lys Lys Trp Lys Lys Thr Cys 645 650 655 tgc ctc ccc tct ttt gtc atc ttt ctt ttt att gtt ggc tgc att att 2016 Cys Leu Pro Ser Phe Val Ile Phe Leu Phe Ile Val Gly Cys Ile Ile 660 665 670 gct ggg atc acc ctt ctg gct ata ttc aga gtt gac ccg aaa cat ctg 2064 Ala Gly Ile Thr Leu Leu Ala Ile Phe Arg Val Asp Pro Lys His Leu 675 680 685 aca gtg aat gct atc ctc att tct atc gca tct gtc gtg ggg ttg gcc 2112 Thr Val Asn Ala Ile Leu Ile Ser Ile Ala Ser Val Val Gly Leu Ala 690 695 700 ttc gtg ctc aat tgt cga aca tgg tgg caa gtg ctg gac tct ctc ctg 2160 Phe Val Leu Asn Cys Arg Thr Trp Trp Gln Val Leu Asp Ser Leu Leu 705 710 715 720 aat tct cag aga aaa cgc ctc cat agt gct gcc tcc aaa tta cac aag 2208 Asn Ser Gln Arg Lys Arg Leu His Ser Ala Ala Ser Lys Leu His Lys 725 730 735 tta aaa agt gaa gga ttc atg aaa gtc ctg aag tgt gag gtg gag ctg 2256 Leu Lys Ser Glu Gly Phe Met Lys Val Leu Lys Cys Glu Val Glu Leu 740 745 750 atg gcc agg atg gca aaa acc att gac agc ttc act caa aac cag aca 2304 Met Ala Arg Met Ala Lys Thr Ile Asp Ser Phe Thr Gln Asn Gln Thr 755 760 765 agg ctg gtg gtt atc att gac ggc ctg gat gcc tgc gag cag gat aaa 2352 Arg Leu Val Val Ile Ile Asp Gly Leu Asp Ala Cys Glu Gln Asp Lys 770 775 780 gtc ctt cag atg ctg gac act gtc cga gtt ctg ttt tca aag ggc cct 2400 Val Leu Gln Met Leu Asp Thr Val Arg Val Leu Phe Ser Lys Gly Pro 785 790 795 800 ttt att gcc att ttt gca agt gat ccg cat att atc ata aag gcc atc 2448 Phe Ile Ala Ile Phe Ala Ser Asp Pro His Ile Ile Ile Lys Ala Ile 805 810 815 aac cag aac ctg aat agt gtg ctt cgt gac tca aat ata aac gga cat 2496 Asn Gln Asn Leu Asn Ser Val Leu Arg Asp Ser Asn Ile Asn Gly His 820 825 830 gac tac atg cgc aat ata gtt cac tta cca gtt ttt ctt aat agt cgt 2544 Asp Tyr Met Arg Asn Ile Val His Leu Pro Val Phe Leu Asn Ser Arg 835 840 845 ggg ctt agc aat gca agg aaa ttc ctt gta act tca gca aca aat ggg 2592 Gly Leu Ser Asn Ala Arg Lys Phe Leu Val Thr Ser Ala Thr Asn Gly 850 855 860 gac att aca tgc tca gac acc aca gga aca cag gag gac act gac aga 2640 Asp Ile Thr Cys Ser Asp Thr Thr Gly Thr Gln Glu Asp Thr Asp Arg 865 870 875 880 aga gtt tca cag aac agc ctt ggg gag atg aca aag ctt ggg agc aaa 2688 Arg Val Ser Gln Asn Ser Leu Gly Glu Met Thr Lys Leu Gly Ser Lys 885 890 895 acg gct ctc aac aga agg gac act tac cgc aga aga cag atg cag aga 2736 Thr Ala Leu Asn Arg Arg Asp Thr Tyr Arg Arg Arg Gln Met Gln Arg 900 905 910 acc atc acg agg cag atg tcc ttt gat ctc aca aag ctg ctg gtc act 2784 Thr Ile Thr Arg Gln Met Ser Phe Asp Leu Thr Lys Leu Leu Val Thr 915 920 925 gag gat tgg ttc agt gac atc agc cct cag acc atg aga cgg cta ctc 2832 Glu Asp Trp Phe Ser Asp Ile Ser Pro Gln Thr Met Arg Arg Leu Leu 930 935 940 aat att gtt tct gtg aca ggg cgg cta cta aga gcc aat cag att act 2880 Asn Ile Val Ser Val Thr Gly Arg Leu Leu Arg Ala Asn Gln Ile Thr 945 950 955 960 ttc aac tgg gac agg cta gcc agc tgg atc aac ctc acc gag cag tgg 2928 Phe Asn Trp Asp Arg Leu Ala Ser Trp Ile Asn Leu Thr Glu Gln Trp 965 970 975 ccg tac cgg acg tct tgg ctt ata cta tat ctg gaa gag act gaa ggt 2976 Pro Tyr Arg Thr Ser Trp Leu Ile Leu Tyr Leu Glu Glu Thr Glu Gly 980 985 990 ctc cct gat cag atg acc tta aag acc atg tat gag aga ata tcg aag 3024 Leu Pro Asp Gln Met Thr Leu Lys Thr Met Tyr Glu Arg Ile Ser Lys 995 1000 1005 aat atc cca aca act aaa gat gtt gag ccg ctt ctt gaa att gat 3069 Asn Ile Pro Thr Thr Lys Asp Val Glu Pro Leu Leu Glu Ile Asp 1010 1015 1020 gga gat ata aga aac ttt gaa gtc ttc ttg tct tca agg act cca 3114 Gly Asp Ile Arg Asn Phe Glu Val Phe Leu Ser Ser Arg Thr Pro 1025 1030 1035 gtt ctc gtg gct cga gat gta aag acc ttt ttg ccg tgc aca gta 3159 Val Leu Val Ala Arg Asp Val Lys Thr Phe Leu Pro Cys Thr Val 1040 1045 1050 aac ctg gat ccc aaa ctt cgg gag atc atc gca gat gtc cga gct 3204 Asn Leu Asp Pro Lys Leu Arg Glu Ile Ile Ala Asp Val Arg Ala 1055 1060 1065 gca aga gag cag atc aac ata gga ggc ctg gcc tat ccc ccg ctc 3249 Ala Arg Glu Gln Ile Asn Ile Gly Gly Leu Ala Tyr Pro Pro Leu 1070 1075 1080 cct ctg cac gaa ggc cct cct cgg cca cct tct ggg tac agt cag 3294 Pro Leu His Glu Gly Pro Pro Arg Pro Pro Ser Gly Tyr Ser Gln 1085 1090 1095 cct gca tct gtc tgc tct tca gca tct ttc aac ggg ccg ttc cca 3339 Pro Ala Ser Val Cys Ser Ser Ala Ser Phe Asn Gly Pro Phe Pro 1100 1105 1110 ggc ggg gtt gtg tct cca cag ccc cac agc agt tac tac agc ggc 3384 Gly Gly Val Val Ser Pro Gln Pro His Ser Ser Tyr Tyr Ser Gly 1115 1120 1125 ttg tcc gga ccc cag cac ccc ttc tac aac agg gca gcc gtc cct 3429 Leu Ser Gly Pro Gln His Pro Phe Tyr Asn Arg Ala Ala Val Pro 1130 1135 1140 gcc aca ggc agc tca ctg tta ctg agt tca atg acc gtg gat gtc 3474 Ala Thr Gly Ser Ser Leu Leu Leu Ser Ser Met Thr Val Asp Val 1145 1150 1155 gta tgt gag aaa ctg aga cag ata gaa ggg ctg gac cag aac atg 3519 Val Cys Glu Lys Leu Arg Gln Ile Glu Gly Leu Asp Gln Asn Met 1160 1165 1170 atg cct cag tac tgc aca aca atc aaa aag gcg aac ata aat ggc 3564 Met Pro Gln Tyr Cys Thr Thr Ile Lys Lys Ala Asn Ile Asn Gly 1175 1180 1185 cgg gta ttg tct cag tgt aac att gat gag ctg aag aaa gag atg 3609 Arg Val Leu Ser Gln Cys Asn Ile Asp Glu Leu Lys Lys Glu Met 1190 1195 1200 gct atg aac ttt gga gac tgg cat ctc ttc aga agc atg gtc tta 3654 Ala Met Asn Phe Gly Asp Trp His Leu Phe Arg Ser Met Val Leu 1205 1210 1215 gaa atg aga agt gtg gaa agc cag gtg gtc cct gaa gac cct cgt 3699 Glu Met Arg Ser Val Glu Ser Gln Val Val Pro Glu Asp Pro Arg 1220 1225 1230 ttt ctc aat gaa aac tcc agt gcc cca gtt cct cat gga gag tct 3744 Phe Leu Asn Glu Asn Ser Ser Ala Pro Val Pro His Gly Glu Ser 1235 1240 1245 gct cgc cgc tct tct cac act gag ctg ccc ctc acg gag ctt tcc 3789 Ala Arg Arg Ser Ser His Thr Glu Leu Pro Leu Thr Glu Leu Ser 1250 1255 1260 agc cag act cca tac acc ctg aac ttc agc ttt gaa gaa ctg aac 3834 Ser Gln Thr Pro Tyr Thr Leu Asn Phe Ser Phe Glu Glu Leu Asn 1265 1270 1275 acg ctt ggc ctt gat gag gga gcc cct agg cac agt aac ctg agt 3879 Thr Leu Gly Leu Asp Glu Gly Ala Pro Arg His Ser Asn Leu Ser 1280 1285 1290 tgg cag tca caa act cgc aga acc cca agt ctc tcg agt ctc aat 3924 Trp Gln Ser Gln Thr Arg Arg Thr Pro Ser Leu Ser Ser Leu Asn 1295 1300 1305 tcc cag gac tcc agt att gaa att tca aag ctt act gat aag gtg 3969 Ser Gln Asp Ser Ser Ile Glu Ile Ser Lys Leu Thr Asp Lys Val 1310 1315 1320 cag gcc gag tat aga gac gcc tat aga gag tac att gct cag atg 4014 Gln Ala Glu Tyr Arg Asp Ala Tyr Arg Glu Tyr Ile Ala Gln Met 1325 1330 1335 tcc cag tta gaa ggg ggc aca ggg tcg tcc aca ata agc ggc aga 4059 Ser Gln Leu Glu Gly Gly Thr Gly Ser Ser Thr Ile Ser Gly Arg 1340 1345 1350 tct tct cca cat agc aca tac tac ata ggc cag agc tct tca ggg 4104 Ser Ser Pro His Ser Thr Tyr Tyr Ile Gly Gln Ser Ser Ser Gly 1355 1360 1365 ggc tcc atc cat tct act cta gaa cag gaa agg ggg aag gag ggt 4149 Gly Ser Ile His Ser Thr Leu Glu Gln Glu Arg Gly Lys Glu Gly 1370 1375 1380 gag cta aag cag gag gat ggg cgg aag tca ttt tta atg aag agg 4194 Glu Leu Lys Gln Glu Asp Gly Arg Lys Ser Phe Leu Met Lys Arg 1385 1390 1395 ggg gat gtc ata gat tac tcc tca tca ggg gtg tcc act aat gag 4239 Gly Asp Val Ile Asp Tyr Ser Ser Ser Gly Val Ser Thr Asn Glu 1400 1405 1410 gcc tca ccc ctg gac ccc att act gaa gaa gat gag aaa tcc gac 4284 Ala Ser Pro Leu Asp Pro Ile Thr Glu Glu Asp Glu Lys Ser Asp 1415 1420 1425 cag tca ggt agt aag cta ctc cca ggc aag aaa tcc tca gaa cgg 4329 Gln Ser Gly Ser Lys Leu Leu Pro Gly Lys Lys Ser Ser Glu Arg 1430 1435 1440 ccc agt ctc ttc cag aca gac ttg aag ctt aag ggg ggt ggt ctt 4374 Pro Ser Leu Phe Gln Thr Asp Leu Lys Leu Lys Gly Gly Gly Leu 1445 1450 1455 cgc tac cag aaa ctg ccc agt gat gaa gat gag tcg ggg aca gga 4419 Arg Tyr Gln Lys Leu Pro Ser Asp Glu Asp Glu Ser Gly Thr Gly 1460 1465 1470 aga gtt cag ata aca ccc cac tgc tca aag atg ata agg aca aaa 4464 Arg Val Gln Ile Thr Pro His Cys Ser Lys Met Ile Arg Thr Lys 1475 1480 1485 agg ctg aag gca aag cag aga gag tgt gca agt ccc cag gag cac 4509 Arg Leu Lys Ala Lys Gln Arg Glu Cys Ala Ser Pro Gln Glu His 1490 1495 1500 agt gcc gag ccc atc aga acg ttc att aaa gca aaa gag tat ttg 4554 Ser Ala Glu Pro Ile Arg Thr Phe Ile Lys Ala Lys Glu Tyr Leu 1505 1510 1515 tcc gat gcc ctc ctt gac aaa aag gat tcc tca gat tcg gga gtg 4599 Ser Asp Ala Leu Leu Asp Lys Lys Asp Ser Ser Asp Ser Gly Val 1520 1525 1530 agg tcc aat gag agt tct ccc aac cac tct ctc cac aat gag gca 4644 Arg Ser Asn Glu Ser Ser Pro Asn His Ser Leu His Asn Glu Ala 1535 1540 1545 gca gac gac tcc caa ctg gaa aag gca aac ctc ata gag ctt gaa 4689 Ala Asp Asp Ser Gln Leu Glu Lys Ala Asn Leu Ile Glu Leu Glu 1550 1555 1560 gat gaa ggc cac agt ggg aag cgg ggg atg cca cac agt ctg agt 4734 Asp Glu Gly His Ser Gly Lys Arg Gly Met Pro His Ser Leu Ser 1565 1570 1575 ggc ttg caa gat cca att ata gct cgg atg tcc att tgc tcg gaa 4779 Gly Leu Gln Asp Pro Ile Ile Ala Arg Met Ser Ile Cys Ser Glu 1580 1585 1590 gac aag aaa agc cct tcc gag tgc agc ctg att gct agc agc cct 4824 Asp Lys Lys Ser Pro Ser Glu Cys Ser Leu Ile Ala Ser Ser Pro 1595 1600 1605 gaa gaa agc tgg cct gcg tgc cag aag gcc tac aat cta aac cga 4869 Glu Glu Ser Trp Pro Ala Cys Gln Lys Ala Tyr Asn Leu Asn Arg 1610 1615 1620 acg ccc agc act gtg act ctg aac aac aac act gca ccc acc aac 4914 Thr Pro Ser Thr Val Thr Leu Asn Asn Asn Thr Ala Pro Thr Asn 1625 1630 1635 aga gcc aat cag aac ttc gat gag ata gag gga atc agg gag acg 4959 Arg Ala Asn Gln Asn Phe Asp Glu Ile Glu Gly Ile Arg Glu Thr 1640 1645 1650 tcc cag gtc att ctg agg cct ggc ccc agt ccc aac cca act gct 5004 Ser Gln Val Ile Leu Arg Pro Gly Pro Ser Pro Asn Pro Thr Ala 1655 1660 1665 gtg cag aat gaa aac ctg aag agc atg gcg cac aag cga agc cag 5049 Val Gln Asn Glu Asn Leu Lys Ser Met Ala His Lys Arg Ser Gln 1670 1675 1680 cgc tcc agt tac aca agg ctc tcg aaa gat gca tct gag cta cat 5094 Arg Ser Ser Tyr Thr Arg Leu Ser Lys Asp Ala Ser Glu Leu His 1685 1690 1695 gca gcc tcc tcg gag agc aca ggc ttt gga gaa gag aga gag agc 5139 Ala Ala Ser Ser Glu Ser Thr Gly Phe Gly Glu Glu Arg Glu Ser 1700 1705 1710 att ctc tga 5148 Ile Leu 1715 2 1715 PRT Rattus norvegicus 2 Met Ser Val Leu Ile Ser Gln Ser Val Ile Asn Tyr Val Glu Glu Glu 1 5 10 15 Asn Ile Pro Ala Leu Lys Ala Leu Leu Glu Lys Cys Lys Asp Val Asp 20 25 30 Glu Arg Asn Glu Cys Gly Gln Thr Pro Leu Met Leu Ala Ala Glu Gln 35 40 45 Gly Asn Val Glu Ile Val Lys Glu Leu Leu Lys Asn Gly Ala Asn Cys 50 55 60 Asn Leu Glu Asp Leu Asp Asn Trp Thr Ala Leu Ile Ser Ala Ser Lys 65 70 75 80 Glu Gly His Ile His Ile Val Glu Glu Leu Leu Lys Ser Gly Ala Ser 85 90 95 Leu Glu His Arg Asp Met Gly Gly Trp Thr Ala Leu Met Trp Ala Cys 100 105 110 Tyr Lys Gly Arg Thr Asp Val Val Glu Leu Leu Leu Ser His Gly Ala 115 120 125 Asn Pro Ser Val Thr Gly Leu Gln Tyr Ser Val Tyr Pro Ile Ile Trp 130 135 140 Ala Ala Gly Arg Gly His Ala Asp Ile Val His Leu Leu Leu Gln Asn 145 150 155 160 Gly Ala Lys Val Asn Cys Ser Asp Lys Tyr Gly Thr Thr Pro Leu Val 165 170 175 Trp Ala Ala Arg Lys Gly His Leu Glu Cys Val Lys His Leu Leu Ala 180 185 190 Met Gly Ala Asp Val Asp Gln Glu Gly Ala Asn Ser Met Thr Ala Leu 195 200 205 Ile Val Ala Val Lys Gly Gly Tyr Thr Gln Ser Val Lys Glu Ile Leu 210 215 220 Lys Arg Asn Pro Asn Val Asn Leu Thr Asp Lys Asp Gly Asn Thr Ala 225 230 235 240 Leu Met Ile Ala Ser Lys Glu Gly His Ile Glu Ile Val Gln Asp Leu 245 250 255 Leu Asp Ala Gly Thr Tyr Val Asn Ile Pro Asp Arg Ser Gly Asp Thr 260 265 270 Val Leu Ile Gly Ala Val Arg Gly Gly His Val Glu Ile Val Arg Ala 275 280 285 Leu Leu Gln Lys Tyr Ala Asp Ile Asp Ile Arg Gly Gln Asp Asn Lys 290 295 300 Thr Ala Leu Tyr Trp Ala Val Glu Lys Gly Asn Ala Thr Met Val Arg 305 310 315 320 Asp Ile Leu Gln Cys Asn Pro Asp Thr Glu Ile Cys Thr Lys Asp Gly 325 330 335 Glu Thr Pro Leu Ile Lys Ala Thr Lys Met Arg Asn Ile Glu Val Val 340 345 350 Glu Leu Leu Leu Asp Lys Gly Ala Lys Val Ser Ala Val Asp Lys Lys 355 360 365 Gly Asp Thr Pro Leu His Val Ala Ile Arg Gly Arg Ser Arg Arg Leu 370 375 380 Ala Glu Leu Leu Leu Arg Asn Pro Lys Asp Gly Arg Leu Leu Tyr Arg 385 390 395 400 Pro Asn Lys Ala Gly Glu Thr Pro Tyr Asn Ile Asp Cys Ser His Gln 405 410 415 Lys Ser Ile Leu Thr Gln Ile Phe Gly Ala Arg His Leu Ser Pro Thr 420 425 430 Glu Thr Asp Gly Asp Met Leu Gly Tyr Asp Leu Tyr Ser Ser Ala Leu 435 440 445 Ala Asp Ile Leu Ser Glu Pro Thr Met Gln Pro Pro Ile Cys Val Gly 450 455 460 Leu Tyr Ala Gln Trp Gly Ser Gly Lys Ser Phe Leu Leu Lys Lys Leu 465 470 475 480 Glu Asp Glu Met Lys Thr Phe Ala Gly Gln Gln Thr Glu Pro Leu Phe 485 490 495 Gln Phe Ser Trp Leu Ile Val Phe Leu Thr Leu Leu Leu Cys Gly Gly 500 505 510 Leu Gly Leu Val Phe Ala Phe Pro Val Asp Thr Asn Leu Ala Ile Ala 515 520 525 Ile Ser Leu Ser Phe Leu Ala Leu Ile Tyr Ile Phe Phe Ile Val Ile 530 535 540 Tyr Phe Gly Gly Arg Arg Glu Gly Glu Ser Trp Asn Trp Ala Trp Ala 545 550 555 560 Leu Ser Thr Arg Leu Ala Arg His Ile Gly Tyr Leu Glu Leu Leu Phe 565 570 575 Lys Leu Met Phe Val Asn Pro Pro Glu Leu Pro Glu Gln Thr Thr Lys 580 585 590 Ala Leu Pro Val Arg Phe Leu Phe Thr Asp Tyr Asn Arg Leu Ser Ser 595 600 605 Val Gly Gly Glu Thr Ser Leu Ala Glu Met Ile Ala Thr Leu Ser Asp 610 615 620 Ala Cys Glu Arg Glu Phe Gly Phe Leu Ala Thr Arg Leu Phe Arg Val 625 630 635 640 Phe Arg Thr Glu Glu Ser Gln Gly Lys Lys Lys Trp Lys Lys Thr Cys 645 650 655 Cys Leu Pro Ser Phe Val Ile Phe Leu Phe Ile Val Gly Cys Ile Ile 660 665 670 Ala Gly Ile Thr Leu Leu Ala Ile Phe Arg Val Asp Pro Lys His Leu 675 680 685 Thr Val Asn Ala Ile Leu Ile Ser Ile Ala Ser Val Val Gly Leu Ala 690 695 700 Phe Val Leu Asn Cys Arg Thr Trp Trp Gln Val Leu Asp Ser Leu Leu 705 710 715 720 Asn Ser Gln Arg Lys Arg Leu His Ser Ala Ala Ser Lys Leu His Lys 725 730 735 Leu Lys Ser Glu Gly Phe Met Lys Val Leu Lys Cys Glu Val Glu Leu 740 745 750 Met Ala Arg Met Ala Lys Thr Ile Asp Ser Phe Thr Gln Asn Gln Thr 755 760 765 Arg Leu Val Val Ile Ile Asp Gly Leu Asp Ala Cys Glu Gln Asp Lys 770 775 780 Val Leu Gln Met Leu Asp Thr Val Arg Val Leu Phe Ser Lys Gly Pro 785 790 795 800 Phe Ile Ala Ile Phe Ala Ser Asp Pro His Ile Ile Ile Lys Ala Ile 805 810 815 Asn Gln Asn Leu Asn Ser Val Leu Arg Asp Ser Asn Ile Asn Gly His 820 825 830 Asp Tyr Met Arg Asn Ile Val His Leu Pro Val Phe Leu Asn Ser Arg 835 840 845 Gly Leu Ser Asn Ala Arg Lys Phe Leu Val Thr Ser Ala Thr Asn Gly 850 855 860 Asp Ile Thr Cys Ser Asp Thr Thr Gly Thr Gln Glu Asp Thr Asp Arg 865 870 875 880 Arg Val Ser Gln Asn Ser Leu Gly Glu Met Thr Lys Leu Gly Ser Lys 885 890 895 Thr Ala Leu Asn Arg Arg Asp Thr Tyr Arg Arg Arg Gln Met Gln Arg 900 905 910 Thr Ile Thr Arg Gln Met Ser Phe Asp Leu Thr Lys Leu Leu Val Thr 915 920 925 Glu Asp Trp Phe Ser Asp Ile Ser Pro Gln Thr Met Arg Arg Leu Leu 930 935 940 Asn Ile Val Ser Val Thr Gly Arg Leu Leu Arg Ala Asn Gln Ile Thr 945 950 955 960 Phe Asn Trp Asp Arg Leu Ala Ser Trp Ile Asn Leu Thr Glu Gln Trp 965 970 975 Pro Tyr Arg Thr Ser Trp Leu Ile Leu Tyr Leu Glu Glu Thr Glu Gly 980 985 990 Leu Pro Asp Gln Met Thr Leu Lys Thr Met Tyr Glu Arg Ile Ser Lys 995 1000 1005 Asn Ile Pro Thr Thr Lys Asp Val Glu Pro Leu Leu Glu Ile Asp 1010 1015 1020 Gly Asp Ile Arg Asn Phe Glu Val Phe Leu Ser Ser Arg Thr Pro 1025 1030 1035 Val Leu Val Ala Arg Asp Val Lys Thr Phe Leu Pro Cys Thr Val 1040 1045 1050 Asn Leu Asp Pro Lys Leu Arg Glu Ile Ile Ala Asp Val Arg Ala 1055 1060 1065 Ala Arg Glu Gln Ile Asn Ile Gly Gly Leu Ala Tyr Pro Pro Leu 1070 1075 1080 Pro Leu His Glu Gly Pro Pro Arg Pro Pro Ser Gly Tyr Ser Gln 1085 1090 1095 Pro Ala Ser Val Cys Ser Ser Ala Ser Phe Asn Gly Pro Phe Pro 1100 1105 1110 Gly Gly Val Val Ser Pro Gln Pro His Ser Ser Tyr Tyr Ser Gly 1115 1120 1125 Leu Ser Gly Pro Gln His Pro Phe Tyr Asn Arg Ala Ala Val Pro 1130 1135 1140 Ala Thr Gly Ser Ser Leu Leu Leu Ser Ser Met Thr Val Asp Val 1145 1150 1155 Val Cys Glu Lys Leu Arg Gln Ile Glu Gly Leu Asp Gln Asn Met 1160 1165 1170 Met Pro Gln Tyr Cys Thr Thr Ile Lys Lys Ala Asn Ile Asn Gly 1175 1180 1185 Arg Val Leu Ser Gln Cys Asn Ile Asp Glu Leu Lys Lys Glu Met 1190 1195 1200 Ala Met Asn Phe Gly Asp Trp His Leu Phe Arg Ser Met Val Leu 1205 1210 1215 Glu Met Arg Ser Val Glu Ser Gln Val Val Pro Glu Asp Pro Arg 1220 1225 1230 Phe Leu Asn Glu Asn Ser Ser Ala Pro Val Pro His Gly Glu Ser 1235 1240 1245 Ala Arg Arg Ser Ser His Thr Glu Leu Pro Leu Thr Glu Leu Ser 1250 1255 1260 Ser Gln Thr Pro Tyr Thr Leu Asn Phe Ser Phe Glu Glu Leu Asn 1265 1270 1275 Thr Leu Gly Leu Asp Glu Gly Ala Pro Arg His Ser Asn Leu Ser 1280 1285 1290 Trp Gln Ser Gln Thr Arg Arg Thr Pro Ser Leu Ser Ser Leu Asn 1295 1300 1305 Ser Gln Asp Ser Ser Ile Glu Ile Ser Lys Leu Thr Asp Lys Val 1310 1315 1320 Gln Ala Glu Tyr Arg Asp Ala Tyr Arg Glu Tyr Ile Ala Gln Met 1325 1330 1335 Ser Gln Leu Glu Gly Gly Thr Gly Ser Ser Thr Ile Ser Gly Arg 1340 1345 1350 Ser Ser Pro His Ser Thr Tyr Tyr Ile Gly Gln Ser Ser Ser Gly 1355 1360 1365 Gly Ser Ile His Ser Thr Leu Glu Gln Glu Arg Gly Lys Glu Gly 1370 1375 1380 Glu Leu Lys Gln Glu Asp Gly Arg Lys Ser Phe Leu Met Lys Arg 1385 1390 1395 Gly Asp Val Ile Asp Tyr Ser Ser Ser Gly Val Ser Thr Asn Glu 1400 1405 1410 Ala Ser Pro Leu Asp Pro Ile Thr Glu Glu Asp Glu Lys Ser Asp 1415 1420 1425 Gln Ser Gly Ser Lys Leu Leu Pro Gly Lys Lys Ser Ser Glu Arg 1430 1435 1440 Pro Ser Leu Phe Gln Thr Asp Leu Lys Leu Lys Gly Gly Gly Leu 1445 1450 1455 Arg Tyr Gln Lys Leu Pro Ser Asp Glu Asp Glu Ser Gly Thr Gly 1460 1465 1470 Arg Val Gln Ile Thr Pro His Cys Ser Lys Met Ile Arg Thr Lys 1475 1480 1485 Arg Leu Lys Ala Lys Gln Arg Glu Cys Ala Ser Pro Gln Glu His 1490 1495 1500 Ser Ala Glu Pro Ile Arg Thr Phe Ile Lys Ala Lys Glu Tyr Leu 1505 1510 1515 Ser Asp Ala Leu Leu Asp Lys Lys Asp Ser Ser Asp Ser Gly Val 1520 1525 1530 Arg Ser Asn Glu Ser Ser Pro Asn His Ser Leu His Asn Glu Ala 1535 1540 1545 Ala Asp Asp Ser Gln Leu Glu Lys Ala Asn Leu Ile Glu Leu Glu 1550 1555 1560 Asp Glu Gly His Ser Gly Lys Arg Gly Met Pro His Ser Leu Ser 1565 1570 1575 Gly Leu Gln Asp Pro Ile Ile Ala Arg Met Ser Ile Cys Ser Glu 1580 1585 1590 Asp Lys Lys Ser Pro Ser Glu Cys Ser Leu Ile Ala Ser Ser Pro 1595 1600 1605 Glu Glu Ser Trp Pro Ala Cys Gln Lys Ala Tyr Asn Leu Asn Arg 1610 1615 1620 Thr Pro Ser Thr Val Thr Leu Asn Asn Asn Thr Ala Pro Thr Asn 1625 1630 1635 Arg Ala Asn Gln Asn Phe Asp Glu Ile Glu Gly Ile Arg Glu Thr 1640 1645 1650 Ser Gln Val Ile Leu Arg Pro Gly Pro Ser Pro Asn Pro Thr Ala 1655 1660 1665 Val Gln Asn Glu Asn Leu Lys Ser Met Ala His Lys Arg Ser Gln 1670 1675 1680 Arg Ser Ser Tyr Thr Arg Leu Ser Lys Asp Ala Ser Glu Leu His 1685 1690 1695 Ala Ala Ser Ser Glu Ser Thr Gly Phe Gly Glu Glu Arg Glu Ser 1700 1705 1710 Ile Leu 1715 3 5305 DNA Homo sapiens CDS (158)..(5305) 3 ggaagaatat ggccgccggg tgtggtgagg gcgacgcgct tgcagtcgcc gtctcttgct 60 tccccgtcct ctgacatcgc ctgcagccga gcgggcccgt tccgccggag ctgaggacca 120 ggtattcaaa taaagttaat tgcagctttc tgtgaaa atg tca gtt ttg ata tca 175 Met Ser Val Leu Ile Ser 1 5 cag agc gtc ata aat tat gta gag gaa gaa aac att cct gct ctg aaa 223 Gln Ser Val Ile Asn Tyr Val Glu Glu Glu Asn Ile Pro Ala Leu Lys 10 15 20 gct ctt ctt gaa aaa tgc aaa gat gta gat gag aga aat gag tgt ggc 271 Ala Leu Leu Glu Lys Cys Lys Asp Val Asp Glu Arg Asn Glu Cys Gly 25 30 35 cag act cca ctg atg ata gct gcc gaa caa ggc aat ctg gaa ata gtg 319 Gln Thr Pro Leu Met Ile Ala Ala Glu Gln Gly Asn Leu Glu Ile Val 40 45 50 aag gaa tta att aag aat gga gct aac tgc aat ctg gaa gat ttg gat 367 Lys Glu Leu Ile Lys Asn Gly Ala Asn Cys Asn Leu Glu Asp Leu Asp 55 60 65 70 aat tgg aca gca ctt ata tct gca tcg aaa gaa ggg cat gtg cac atc 415 Asn Trp Thr Ala Leu Ile Ser Ala Ser Lys Glu Gly His Val His Ile 75 80 85 gta gag gaa cta ctg aaa tgt ggg gtt aac ttg gag cac cgt gat atg 463 Val Glu Glu Leu Leu Lys Cys Gly Val Asn Leu Glu His Arg Asp Met 90 95 100 gga gga tgg aca gct ctt atg tgg gca tgt tac aaa ggc cgt act gac 511 Gly Gly Trp Thr Ala Leu Met Trp Ala Cys Tyr Lys Gly Arg Thr Asp 105 110 115 gta gta gag ttg ctt ctt tct cat ggt gcc aat cca agt gtc act ggt 559 Val Val Glu Leu Leu Leu Ser His Gly Ala Asn Pro Ser Val Thr Gly 120 125 130 ctg cag tac agt gtt tac cca atc att tgg gca gca ggg aga ggc cat 607 Leu Gln Tyr Ser Val Tyr Pro Ile Ile Trp Ala Ala Gly Arg Gly His 135 140 145 150 gca gat ata gtt cat ctt tta ctg caa aat ggt gct aaa gtc aac tgc 655 Ala Asp Ile Val His Leu Leu Leu Gln Asn Gly Ala Lys Val Asn Cys 155 160 165 tct gat aag tat gga acc acc cct tta gtt tgg gct gca cga aag ggt 703 Ser Asp Lys Tyr Gly Thr Thr Pro Leu Val Trp Ala Ala Arg Lys Gly 170 175 180 cat ttg gaa tgt gtg aaa cat tta ttg gcc atg gga gct gat gtg gat 751 His Leu Glu Cys Val Lys His Leu Leu Ala Met Gly Ala Asp Val Asp 185 190 195 caa gaa gga gct aat tca atg act gca ctt att gtg gca gtg aaa gga 799 Gln Glu Gly Ala Asn Ser Met Thr Ala Leu Ile Val Ala Val Lys Gly 200 205 210 ggt tac aca cag tca gta aaa gaa att ttg aag agg aat cca aat gta 847 Gly Tyr Thr Gln Ser Val Lys Glu Ile Leu Lys Arg Asn Pro Asn Val 215 220 225 230 aac tta aca gat aaa gat gga aat aca gct ttg atg att gca tca aag 895 Asn Leu Thr Asp Lys Asp Gly Asn Thr Ala Leu Met Ile Ala Ser Lys 235 240 245 gag gga cat acg gag att gtg cag gat ctg ctc gac gct gga aca tat 943 Glu Gly His Thr Glu Ile Val Gln Asp Leu Leu Asp Ala Gly Thr Tyr 250 255 260 gtg aac ata cct gac agg agt ggg gat act gtg ttg att ggc gct gtc 991 Val Asn Ile Pro Asp Arg Ser Gly Asp Thr Val Leu Ile Gly Ala Val 265 270 275 aga ggt ggt cat gtt gaa att gtt cga gcg ctt ctc caa aaa tat gct 1039 Arg Gly Gly His Val Glu Ile Val Arg Ala Leu Leu Gln Lys Tyr Ala 280 285 290 gat ata gac att aga gga cag gat aat aaa act gct ttg tat tgg gct 1087 Asp Ile Asp Ile Arg Gly Gln Asp Asn Lys Thr Ala Leu Tyr Trp Ala 295 300 305 310 gtt gag aaa gga aat gca aca atg gtg aga gat atc tta cag tgc aat 1135 Val Glu Lys Gly Asn Ala Thr Met Val Arg Asp Ile Leu Gln Cys Asn 315 320 325 cct gac act gaa ata tgc aca aag gat ggt gaa acg cca ctt ata aag 1183 Pro Asp Thr Glu Ile Cys Thr Lys Asp Gly Glu Thr Pro Leu Ile Lys 330 335 340 gct acc aag atg aga aac att gaa gtg gtg gag ctg ctg cta gat aaa 1231 Ala Thr Lys Met Arg Asn Ile Glu Val Val Glu Leu Leu Leu Asp Lys 345 350 355 ggt gct aaa gtg tct gct gta gat aag aaa gga gat act ccc ttg cat 1279 Gly Ala Lys Val Ser Ala Val Asp Lys Lys Gly Asp Thr Pro Leu His 360 365 370 att gct att cgt gga agg agc cgg aaa ctg gca gaa ctg ctt tta aga 1327 Ile Ala Ile Arg Gly Arg Ser Arg Lys Leu Ala Glu Leu Leu Leu Arg 375 380 385 390 aat ccc aaa gat ggg cga tta ctt tat agg ccc aac aaa gca ggc gag 1375 Asn Pro Lys Asp Gly Arg Leu Leu Tyr Arg Pro Asn Lys Ala Gly Glu 395 400 405 act cct tat aat att gac tgt agc cat cag aag agt att tta act caa 1423 Thr Pro Tyr Asn Ile Asp Cys Ser His Gln Lys Ser Ile Leu Thr Gln 410 415 420 ata ttt gga gcc aga cac ttg tct cct act gaa aca gac ggt gac atg 1471 Ile Phe Gly Ala Arg His Leu Ser Pro Thr Glu Thr Asp Gly Asp Met 425 430 435 ctt gga tat gat tta tat agc agt gcc ctg gca gat att ctc agt gag 1519 Leu Gly Tyr Asp Leu Tyr Ser Ser Ala Leu Ala Asp Ile Leu Ser Glu 440 445 450 cct acc atg cag cca ccc att tgt gtg ggg tta tat gca cag tgg gga 1567 Pro Thr Met Gln Pro Pro Ile Cys Val Gly Leu Tyr Ala Gln Trp Gly 455 460 465 470 agt ggg aaa tct ttc tta ctc aag aaa cta gaa gac gaa atg aaa acc 1615 Ser Gly Lys Ser Phe Leu Leu Lys Lys Leu Glu Asp Glu Met Lys Thr 475 480 485 ttc gcc gga caa cag att gag cct ctc ttt cag ttc tca tgg ctc ata 1663 Phe Ala Gly Gln Gln Ile Glu Pro Leu Phe Gln Phe Ser Trp Leu Ile 490 495 500 gtg ttt ctt acc ctg cta ctt tgt gga ggg ctt ggt tta ttg ttt gcc 1711 Val Phe Leu Thr Leu Leu Leu Cys Gly Gly Leu Gly Leu Leu Phe Ala 505 510 515 ttc acg gtc cac cca aat ctt gga ata gca gtg tca ctg agc ttc ttg 1759 Phe Thr Val His Pro Asn Leu Gly Ile Ala Val Ser Leu Ser Phe Leu 520 525 530 gct ctc tta tat ata ttc ttt att gtc att tac ttt ggt gga cga aga 1807 Ala Leu Leu Tyr Ile Phe Phe Ile Val Ile Tyr Phe Gly Gly Arg Arg 535 540 545 550 gaa gga gag agt tgg aat tgg gcc tgg gtc ctc agc act aga ttg gca 1855 Glu Gly Glu Ser Trp Asn Trp Ala Trp Val Leu Ser Thr Arg Leu Ala 555 560 565 aga cat att gga tat ttg gaa ctc ctc ctt aaa ttg atg ttt gtg aat 1903 Arg His Ile Gly Tyr Leu Glu Leu Leu Leu Lys Leu Met Phe Val Asn 570 575 580 cca cct gag ttg cca gag cag act act aaa gct tta cct gtg agg ttt 1951 Pro Pro Glu Leu Pro Glu Gln Thr Thr Lys Ala Leu Pro Val Arg Phe 585 590 595 ttg ttt aca gat tac aat aga ctg tcc agt gta ggt gga gaa act tct 1999 Leu Phe Thr Asp Tyr Asn Arg Leu Ser Ser Val Gly Gly Glu Thr Ser 600 605 610 ctg gct gaa atg att gca acc ctc tcg gat gct tgt gaa aga gag ttt 2047 Leu Ala Glu Met Ile Ala Thr Leu Ser Asp Ala Cys Glu Arg Glu Phe 615 620 625 630 ggc ttt ttg gca acc agg ctt ttt cga gta ttc aag act gaa gat act 2095 Gly Phe Leu Ala Thr Arg Leu Phe Arg Val Phe Lys Thr Glu Asp Thr 635 640 645 cag ggt aaa aag aaa tgg aaa aaa aca tgt tgt ctc cca tct ttt gtc 2143 Gln Gly Lys Lys Lys Trp Lys Lys Thr Cys Cys Leu Pro Ser Phe Val 650 655 660 atc ttc ctt ttt atc att ggc tgc att ata tct gga att act ctt ctg 2191 Ile Phe Leu Phe Ile Ile Gly Cys Ile Ile Ser Gly Ile Thr Leu Leu 665 670 675 gct ata ttt aga gtt gac cca aag cat ctg act gta aat gct gtc ctc 2239 Ala Ile Phe Arg Val Asp Pro Lys His Leu Thr Val Asn Ala Val Leu 680 685 690 ata tca atc gca tct gta gtg gga ttg gcc ttt gtg ttg aac tgt cgt 2287 Ile Ser Ile Ala Ser Val Val Gly Leu Ala Phe Val Leu Asn Cys Arg 695 700 705 710 aca tgg tgg caa gtg ctg gac tcg ctc ctg aat tcc caa aga aaa cgc 2335 Thr Trp Trp Gln Val Leu Asp Ser Leu Leu Asn Ser Gln Arg Lys Arg 715 720 725 ctc cat aat gca gcc tcc aaa ctg cac aaa ttg aaa agt gaa gga ttc 2383 Leu His Asn Ala Ala Ser Lys Leu His Lys Leu Lys Ser Glu Gly Phe 730 735 740 atg aaa gtt ctt aaa tgt gaa gtg gaa ttg atg gcc agg atg gca aaa 2431 Met Lys Val Leu Lys Cys Glu Val Glu Leu Met Ala Arg Met Ala Lys 745 750 755 acc att gac agc ttc act cag aat cag aca agg ctg gtg gtc atc atc 2479 Thr Ile Asp Ser Phe Thr Gln Asn Gln Thr Arg Leu Val Val Ile Ile 760 765 770 gat gga tta gat gcc tgt gag cag gac aaa gtc ctt cag atg ctg gac 2527 Asp Gly Leu Asp Ala Cys Glu Gln Asp Lys Val Leu Gln Met Leu Asp 775 780 785 790 act gtc cga gtt ctg ttt tca aaa ggc ccg ttc att gcc att ttt gca 2575 Thr Val Arg Val Leu Phe Ser Lys Gly Pro Phe Ile Ala Ile Phe Ala 795 800 805 agt gat cca cat att atc ata aag gca att aac cag aac ctc aat agt 2623 Ser Asp Pro His Ile Ile Ile Lys Ala Ile Asn Gln Asn Leu Asn Ser 810 815 820 gtg ctt cgg gat tca aat ata aat ggc cat gac tac atg cgc aac ata 2671 Val Leu Arg Asp Ser Asn Ile Asn Gly His Asp Tyr Met Arg Asn Ile 825 830 835 gtc cac ttg cct gtg ttc ctt aat agt cgt gga cta agc aat gca aga 2719 Val His Leu Pro Val Phe Leu Asn Ser Arg Gly Leu Ser Asn Ala Arg 840 845 850 aaa ttt ctc gta act tca gca aca aat gga gac gtt cca tgc tca gat 2767 Lys Phe Leu Val Thr Ser Ala Thr Asn Gly Asp Val Pro Cys Ser Asp 855 860 865 870 act aca ggg ata cag gaa gat gct gac aga aga gtt tca cag aac agc 2815 Thr Thr Gly Ile Gln Glu Asp Ala Asp Arg Arg Val Ser Gln Asn Ser 875 880 885 ctt ggg gag atg aca aaa ctt ggt agc aag aca gcc ctc aat aga cgg 2863 Leu Gly Glu Met Thr Lys Leu Gly Ser Lys Thr Ala Leu Asn Arg Arg 890 895 900 gac act tac cga aga agg cag atg cag agg acc atc act cgc cag atg 2911 Asp Thr Tyr Arg Arg Arg Gln Met Gln Arg Thr Ile Thr Arg Gln Met 905 910 915 tcc ttt gat ctt aca aaa ctg ctg gtt acc gag gac tgg ttc agt gac 2959 Ser Phe Asp Leu Thr Lys Leu Leu Val Thr Glu Asp Trp Phe Ser Asp 920 925 930 atc agt ccc cag acc atg aga aga tta ctt aat att gtt tct gtg aca 3007 Ile Ser Pro Gln Thr Met Arg Arg Leu Leu Asn Ile Val Ser Val Thr 935 940 945 950 gga cga tta ctg aga gcc aat cag att agt ttc aac tgg gac agg ctt 3055 Gly Arg Leu Leu Arg Ala Asn Gln Ile Ser Phe Asn Trp Asp Arg Leu 955 960 965 gct agc tgg atc aac ctt act gag cag tgg cca tac cgg act tca tgg 3103 Ala Ser Trp Ile Asn Leu Thr Glu Gln Trp Pro Tyr Arg Thr Ser Trp 970 975 980 ctc ata tta tat ttg gaa gag act gaa ggt att cca gat caa atg aca 3151 Leu Ile Leu Tyr Leu Glu Glu Thr Glu Gly Ile Pro Asp Gln Met Thr 985 990 995 tta aaa acc atc tac gaa aga ata tca aag aat att cca aca act 3196 Leu Lys Thr Ile Tyr Glu Arg Ile Ser Lys Asn Ile Pro Thr Thr 1000 1005 1010 aag gat gtt gag cca ctt ctt gaa att gat gga gat ata aga aat 3241 Lys Asp Val Glu Pro Leu Leu Glu Ile Asp Gly Asp Ile Arg Asn 1015 1020 1025 ttt gaa gtg ttt ttg tct tca agg acc cca gtt ctt gtg gct cga 3286 Phe Glu Val Phe Leu Ser Ser Arg Thr Pro Val Leu Val Ala Arg 1030 1035 1040 gat gta aaa gtc ttt ttg cca tgc act gta aac cta gat ccc aaa 3331 Asp Val Lys Val Phe Leu Pro Cys Thr Val Asn Leu Asp Pro Lys 1045 1050 1055 cta cgg gaa att att gca gat gtt cgt gct gcc aga gag cag atc 3376 Leu Arg Glu Ile Ile Ala Asp Val Arg Ala Ala Arg Glu Gln Ile 1060 1065 1070 agt att gga gga ctg gcg tac ccc ccg ctc cct cta cat gag ggt 3421 Ser Ile Gly Gly Leu Ala Tyr Pro Pro Leu Pro Leu His Glu Gly 1075 1080 1085 cct cct agg gcg cca tca ggg tac agc cag ccc cca tcc gtg tgc 3466 Pro Pro Arg Ala Pro Ser Gly Tyr Ser Gln Pro Pro Ser Val Cys 1090 1095 1100 tct tcc acg tcc ttc aat ggg ccc ttc gca ggt gga gtg gtg tca 3511 Ser Ser Thr Ser Phe Asn Gly Pro Phe Ala Gly Gly Val Val Ser 1105 1110 1115 cca cag cct cac agc agc tat tac agc ggc atg acg ggc cct cag 3556 Pro Gln Pro His Ser Ser Tyr Tyr Ser Gly Met Thr Gly Pro Gln 1120 1125 1130 cat ccc ttc tac aac agg ggg tca ggc cca gcc cca ggc cca gtg 3601 His Pro Phe Tyr Asn Arg Gly Ser Gly Pro Ala Pro Gly Pro Val 1135 1140 1145 gta tta ctg aat tca ctg aat gtg gat gca gta tgt gag aag ctg 3646 Val Leu Leu Asn Ser Leu Asn Val Asp Ala Val Cys Glu Lys Leu 1150 1155 1160 aaa caa ata gaa ggg ctg gac cag agt atg ctg cct cag tat tgt 3691 Lys Gln Ile Glu Gly Leu Asp Gln Ser Met Leu Pro Gln Tyr Cys 1165 1170 1175 acc acg atc aaa aag gca aac ata aat ggc cgt gtg tta gct cag 3736 Thr Thr Ile Lys Lys Ala Asn Ile Asn Gly Arg Val Leu Ala Gln 1180 1185 1190 tgt aac att gat gag ctg aag aaa gag atg aat atg aat ttt gga 3781 Cys Asn Ile Asp Glu Leu Lys Lys Glu Met Asn Met Asn Phe Gly 1195 1200 1205 gac tgg cac ctt ttc aga agc aca gta cta gaa atg aga aac gca 3826 Asp Trp His Leu Phe Arg Ser Thr Val Leu Glu Met Arg Asn Ala 1210 1215 1220 gaa agc cac gtg gtc cct gaa gac cca cgt ttc ctc agt gag agc 3871 Glu Ser His Val Val Pro Glu Asp Pro Arg Phe Leu Ser Glu Ser 1225 1230 1235 agc agt ggc cca gcc ccg cac ggt gag cct gct cgc cgc gct tcc 3916 Ser Ser Gly Pro Ala Pro His Gly Glu Pro Ala Arg Arg Ala Ser 1240 1245 1250 cac aac gag ctg cct cac acc gag ctc tcc agc cag acg ccc tac 3961 His Asn Glu Leu Pro His Thr Glu Leu Ser Ser Gln Thr Pro Tyr 1255 1260 1265 aca ctc aac ttc agc ttc gaa gag ctg aac acg ctt ggc ctg gat 4006 Thr Leu Asn Phe Ser Phe Glu Glu Leu Asn Thr Leu Gly Leu Asp 1270 1275 1280 gaa ggt gcc cct cgt cac agt aat cta agt tgg cag tca caa act 4051 Glu Gly Ala Pro Arg His Ser Asn Leu Ser Trp Gln Ser Gln Thr 1285 1290 1295 cgc aga acc cca agt ctt tcg agt ctc aat tcc cag gat tcc agt 4096 Arg Arg Thr Pro Ser Leu Ser Ser Leu Asn Ser Gln Asp Ser Ser 1300 1305 1310 att gaa att tca aag ctt act gat aag gtg cag gcc gag tat aga 4141 Ile Glu Ile Ser Lys Leu Thr Asp Lys Val Gln Ala Glu Tyr Arg 1315 1320 1325 gat gcc tat aga gaa tac att gct cag atg tcc cag tta gaa ggg 4186 Asp Ala Tyr Arg Glu Tyr Ile Ala Gln Met Ser Gln Leu Glu Gly 1330 1335 1340 ggc ccc ggg tct aca acc att agt ggc aga tct tct cca cat agc 4231 Gly Pro Gly Ser Thr Thr Ile Ser Gly Arg Ser Ser Pro His Ser 1345 1350 1355 aca tat tac atg ggt cag agt tca tca ggg ggc tct att cat tca 4276 Thr Tyr Tyr Met Gly Gln Ser Ser Ser Gly Gly Ser Ile His Ser 1360 1365 1370 aac cta gag caa gaa aag ggg aag gat agt gaa cca aag ccc gat 4321 Asn Leu Glu Gln Glu Lys Gly Lys Asp Ser Glu Pro Lys Pro Asp 1375 1380 1385 gat ggg agg aag tcc ttt cta atg aag agg gga gat gtt atc gat 4366 Asp Gly Arg Lys Ser Phe Leu Met Lys Arg Gly Asp Val Ile Asp 1390 1395 1400 tat tca tca tca ggg gtt tcc acc aac gat gct tcc ccc ctg gat 4411 Tyr Ser Ser Ser Gly Val Ser Thr Asn Asp Ala Ser Pro Leu Asp 1405 1410 1415 cct atc act gaa gaa gat gaa aaa tca gat cag tca ggc agt aag 4456 Pro Ile Thr Glu Glu Asp Glu Lys Ser Asp Gln Ser Gly Ser Lys 1420 1425 1430 ctt ctc cca ggc aag aaa tct tcc gaa agg tca agc ctc ttc cag 4501 Leu Leu Pro Gly Lys Lys Ser Ser Glu Arg Ser Ser Leu Phe Gln 1435 1440 1445 aca gat ttg aag ctt aag gga agt ggg ctg cgc tat caa aaa ctc 4546 Thr Asp Leu Lys Leu Lys Gly Ser Gly Leu Arg Tyr Gln Lys Leu 1450 1455 1460 cca agt gac gag gat gaa tct ggc aca gaa gaa tca gat aac act 4591 Pro Ser Asp Glu Asp Glu Ser Gly Thr Glu Glu Ser Asp Asn Thr 1465 1470 1475 cca ctg ctc aaa gat gac aaa gac aga aaa gcc gaa ggg aaa gta 4636 Pro Leu Leu Lys Asp Asp Lys Asp Arg Lys Ala Glu Gly Lys Val 1480 1485 1490 gag aga gtg ccg aag tct cca gaa cac agt gct gag ccg atc aga 4681 Glu Arg Val Pro Lys Ser Pro Glu His Ser Ala Glu Pro Ile Arg 1495 1500 1505 acc ttc att aaa gcc aaa gag tat tta tcg gat gcg ctc ctt gac 4726 Thr Phe Ile Lys Ala Lys Glu Tyr Leu Ser Asp Ala Leu Leu Asp 1510 1515 1520 aaa aag gat tca tcg gat tca gga gtg aga tcc agt gaa agt tct 4771 Lys Lys Asp Ser Ser Asp Ser Gly Val Arg Ser Ser Glu Ser Ser 1525 1530 1535 ccc aat cac tct ctg cac aat gaa gtg gcg gat gac tcc cag ctt 4816 Pro Asn His Ser Leu His Asn Glu Val Ala Asp Asp Ser Gln Leu 1540 1545 1550 gaa aag gca aat ctc ata gag ctg gaa gat gac agt cac agc gga 4861 Glu Lys Ala Asn Leu Ile Glu Leu Glu Asp Asp Ser His Ser Gly 1555 1560 1565 aag cgg gga atc cca cat agc ctg agt ggc ctg caa gat cca att 4906 Lys Arg Gly Ile Pro His Ser Leu Ser Gly Leu Gln Asp Pro Ile 1570 1575 1580 ata gct cgg atg tcc att tgt tca gaa gac aag aaa agc cct tcc 4951 Ile Ala Arg Met Ser Ile Cys Ser Glu Asp Lys Lys Ser Pro Ser 1585 1590 1595 gaa tgc agc ttg ata gcc agc agc cct gaa gaa aac tgg cct gca 4996 Glu Cys Ser Leu Ile Ala Ser Ser Pro Glu Glu Asn Trp Pro Ala 1600 1605 1610 tgc cag aaa gcc tac aac ctg aac cga act ccc agc acc gtg act 5041 Cys Gln Lys Ala Tyr Asn Leu Asn Arg Thr Pro Ser Thr Val Thr 1615 1620 1625 ctg aac aac aat agt gct cca gcc aac aga gcc aat caa aat ttc 5086 Leu Asn Asn Asn Ser Ala Pro Ala Asn Arg Ala Asn Gln Asn Phe 1630 1635 1640 gat gag atg gag gga att agg gag act tct caa gtc att ttg agg 5131 Asp Glu Met Glu Gly Ile Arg Glu Thr Ser Gln Val Ile Leu Arg 1645 1650 1655 cct agt tcc agt ccc aac cca acc act att cag aat gag aat cta 5176 Pro Ser Ser Ser Pro Asn Pro Thr Thr Ile Gln Asn Glu Asn Leu 1660 1665 1670 aaa agc atg aca cat aag cga agc caa cgt tca agt tac aca agg 5221 Lys Ser Met Thr His Lys Arg Ser Gln Arg Ser Ser Tyr Thr Arg 1675 1680 1685 ctc tcc aaa gat cct ccg gag ctc cat gca gca gcc tct tct gag 5266 Leu Ser Lys Asp Pro Pro Glu Leu His Ala Ala Ala Ser Ser Glu 1690 1695 1700 agc aca ggc ttt gga gaa gaa aga gaa agc att ctt tga 5305 Ser Thr Gly Phe Gly Glu Glu Arg Glu Ser Ile Leu 1705 1710 1715 4 1715 PRT Homo sapiens 4 Met Ser Val Leu Ile Ser Gln Ser Val Ile Asn Tyr Val Glu Glu Glu 1 5 10 15 Asn Ile Pro Ala Leu Lys Ala Leu Leu Glu Lys Cys Lys Asp Val Asp 20 25 30 Glu Arg Asn Glu Cys Gly Gln Thr Pro Leu Met Ile Ala Ala Glu Gln 35 40 45 Gly Asn Leu Glu Ile Val Lys Glu Leu Ile Lys Asn Gly Ala Asn Cys 50 55 60 Asn Leu Glu Asp Leu Asp Asn Trp Thr Ala Leu Ile Ser Ala Ser Lys 65 70 75 80 Glu Gly His Val His Ile Val Glu Glu Leu Leu Lys Cys Gly Val Asn 85 90 95 Leu Glu His Arg Asp Met Gly Gly Trp Thr Ala Leu Met Trp Ala Cys 100 105 110 Tyr Lys Gly Arg Thr Asp Val Val Glu Leu Leu Leu Ser His Gly Ala 115 120 125 Asn Pro Ser Val Thr Gly Leu Gln Tyr Ser Val Tyr Pro Ile Ile Trp 130 135 140 Ala Ala Gly Arg Gly His Ala Asp Ile Val His Leu Leu Leu Gln Asn 145 150 155 160 Gly Ala Lys Val Asn Cys Ser Asp Lys Tyr Gly Thr Thr Pro Leu Val 165 170 175 Trp Ala Ala Arg Lys Gly His Leu Glu Cys Val Lys His Leu Leu Ala 180 185 190 Met Gly Ala Asp Val Asp Gln Glu Gly Ala Asn Ser Met Thr Ala Leu 195 200 205 Ile Val Ala Val Lys Gly Gly Tyr Thr Gln Ser Val Lys Glu Ile Leu 210 215 220 Lys Arg Asn Pro Asn Val Asn Leu Thr Asp Lys Asp Gly Asn Thr Ala 225 230 235 240 Leu Met Ile Ala Ser Lys Glu Gly His Thr Glu Ile Val Gln Asp Leu 245 250 255 Leu Asp Ala Gly Thr Tyr Val Asn Ile Pro Asp Arg Ser Gly Asp Thr 260 265 270 Val Leu Ile Gly Ala Val Arg Gly Gly His Val Glu Ile Val Arg Ala 275 280 285 Leu Leu Gln Lys Tyr Ala Asp Ile Asp Ile Arg Gly Gln Asp Asn Lys 290 295 300 Thr Ala Leu Tyr Trp Ala Val Glu Lys Gly Asn Ala Thr Met Val Arg 305 310 315 320 Asp Ile Leu Gln Cys Asn Pro Asp Thr Glu Ile Cys Thr Lys Asp Gly 325 330 335 Glu Thr Pro Leu Ile Lys Ala Thr Lys Met Arg Asn Ile Glu Val Val 340 345 350 Glu Leu Leu Leu Asp Lys Gly Ala Lys Val Ser Ala Val Asp Lys Lys 355 360 365 Gly Asp Thr Pro Leu His Ile Ala Ile Arg Gly Arg Ser Arg Lys Leu 370 375 380 Ala Glu Leu Leu Leu Arg Asn Pro Lys Asp Gly Arg Leu Leu Tyr Arg 385 390 395 400 Pro Asn Lys Ala Gly Glu Thr Pro Tyr Asn Ile Asp Cys Ser His Gln 405 410 415 Lys Ser Ile Leu Thr Gln Ile Phe Gly Ala Arg His Leu Ser Pro Thr 420 425 430 Glu Thr Asp Gly Asp Met Leu Gly Tyr Asp Leu Tyr Ser Ser Ala Leu 435 440 445 Ala Asp Ile Leu Ser Glu Pro Thr Met Gln Pro Pro Ile Cys Val Gly 450 455 460 Leu Tyr Ala Gln Trp Gly Ser Gly Lys Ser Phe Leu Leu Lys Lys Leu 465 470 475 480 Glu Asp Glu Met Lys Thr Phe Ala Gly Gln Gln Ile Glu Pro Leu Phe 485 490 495 Gln Phe Ser Trp Leu Ile Val Phe Leu Thr Leu Leu Leu Cys Gly Gly 500 505 510 Leu Gly Leu Leu Phe Ala Phe Thr Val His Pro Asn Leu Gly Ile Ala 515 520 525 Val Ser Leu Ser Phe Leu Ala Leu Leu Tyr Ile Phe Phe Ile Val Ile 530 535 540 Tyr Phe Gly Gly Arg Arg Glu Gly Glu Ser Trp Asn Trp Ala Trp Val 545 550 555 560 Leu Ser Thr Arg Leu Ala Arg His Ile Gly Tyr Leu Glu Leu Leu Leu 565 570 575 Lys Leu Met Phe Val Asn Pro Pro Glu Leu Pro Glu Gln Thr Thr Lys 580 585 590 Ala Leu Pro Val Arg Phe Leu Phe Thr Asp Tyr Asn Arg Leu Ser Ser 595 600 605 Val Gly Gly Glu Thr Ser Leu Ala Glu Met Ile Ala Thr Leu Ser Asp 610 615 620 Ala Cys Glu Arg Glu Phe Gly Phe Leu Ala Thr Arg Leu Phe Arg Val 625 630 635 640 Phe Lys Thr Glu Asp Thr Gln Gly Lys Lys Lys Trp Lys Lys Thr Cys 645 650 655 Cys Leu Pro Ser Phe Val Ile Phe Leu Phe Ile Ile Gly Cys Ile Ile 660 665 670 Ser Gly Ile Thr Leu Leu Ala Ile Phe Arg Val Asp Pro Lys His Leu 675 680 685 Thr Val Asn Ala Val Leu Ile Ser Ile Ala Ser Val Val Gly Leu Ala 690 695 700 Phe Val Leu Asn Cys Arg Thr Trp Trp Gln Val Leu Asp Ser Leu Leu 705 710 715 720 Asn Ser Gln Arg Lys Arg Leu His Asn Ala Ala Ser Lys Leu His Lys 725 730 735 Leu Lys Ser Glu Gly Phe Met Lys Val Leu Lys Cys Glu Val Glu Leu 740 745 750 Met Ala Arg Met Ala Lys Thr Ile Asp Ser Phe Thr Gln Asn Gln Thr 755 760 765 Arg Leu Val Val Ile Ile Asp Gly Leu Asp Ala Cys Glu Gln Asp Lys 770 775 780 Val Leu Gln Met Leu Asp Thr Val Arg Val Leu Phe Ser Lys Gly Pro 785 790 795 800 Phe Ile Ala Ile Phe Ala Ser Asp Pro His Ile Ile Ile Lys Ala Ile 805 810 815 Asn Gln Asn Leu Asn Ser Val Leu Arg Asp Ser Asn Ile Asn Gly His 820 825 830 Asp Tyr Met Arg Asn Ile Val His Leu Pro Val Phe Leu Asn Ser Arg 835 840 845 Gly Leu Ser Asn Ala Arg Lys Phe Leu Val Thr Ser Ala Thr Asn Gly 850 855 860 Asp Val Pro Cys Ser Asp Thr Thr Gly Ile Gln Glu Asp Ala Asp Arg 865 870 875 880 Arg Val Ser Gln Asn Ser Leu Gly Glu Met Thr Lys Leu Gly Ser Lys 885 890 895 Thr Ala Leu Asn Arg Arg Asp Thr Tyr Arg Arg Arg Gln Met Gln Arg 900 905 910 Thr Ile Thr Arg Gln Met Ser Phe Asp Leu Thr Lys Leu Leu Val Thr 915 920 925 Glu Asp Trp Phe Ser Asp Ile Ser Pro Gln Thr Met Arg Arg Leu Leu 930 935 940 Asn Ile Val Ser Val Thr Gly Arg Leu Leu Arg Ala Asn Gln Ile Ser 945 950 955 960 Phe Asn Trp Asp Arg Leu Ala Ser Trp Ile Asn Leu Thr Glu Gln Trp 965 970 975 Pro Tyr Arg Thr Ser Trp Leu Ile Leu Tyr Leu Glu Glu Thr Glu Gly 980 985 990 Ile Pro Asp Gln Met Thr Leu Lys Thr Ile Tyr Glu Arg Ile Ser Lys 995 1000 1005 Asn Ile Pro Thr Thr Lys Asp Val Glu Pro Leu Leu Glu Ile Asp 1010 1015 1020 Gly Asp Ile Arg Asn Phe Glu Val Phe Leu Ser Ser Arg Thr Pro 1025 1030 1035 Val Leu Val Ala Arg Asp Val Lys Val Phe Leu Pro Cys Thr Val 1040 1045 1050 Asn Leu Asp Pro Lys Leu Arg Glu Ile Ile Ala Asp Val Arg Ala 1055 1060 1065 Ala Arg Glu Gln Ile Ser Ile Gly Gly Leu Ala Tyr Pro Pro Leu 1070 1075 1080 Pro Leu His Glu Gly Pro Pro Arg Ala Pro Ser Gly Tyr Ser Gln 1085 1090 1095 Pro Pro Ser Val Cys Ser Ser Thr Ser Phe Asn Gly Pro Phe Ala 1100 1105 1110 Gly Gly Val Val Ser Pro Gln Pro His Ser Ser Tyr Tyr Ser Gly 1115 1120 1125 Met Thr Gly Pro Gln His Pro Phe Tyr Asn Arg Gly Ser Gly Pro 1130 1135 1140 Ala Pro Gly Pro Val Val Leu Leu Asn Ser Leu Asn Val Asp Ala 1145 1150 1155 Val Cys Glu Lys Leu Lys Gln Ile Glu Gly Leu Asp Gln Ser Met 1160 1165 1170 Leu Pro Gln Tyr Cys Thr Thr Ile Lys Lys Ala Asn Ile Asn Gly 1175 1180 1185 Arg Val Leu Ala Gln Cys Asn Ile Asp Glu Leu Lys Lys Glu Met 1190 1195 1200 Asn Met Asn Phe Gly Asp Trp His Leu Phe Arg Ser Thr Val Leu 1205 1210 1215 Glu Met Arg Asn Ala Glu Ser His Val Val Pro Glu Asp Pro Arg 1220 1225 1230 Phe Leu Ser Glu Ser Ser Ser Gly Pro Ala Pro His Gly Glu Pro 1235 1240 1245 Ala Arg Arg Ala Ser His Asn Glu Leu Pro His Thr Glu Leu Ser 1250 1255 1260 Ser Gln Thr Pro Tyr Thr Leu Asn Phe Ser Phe Glu Glu Leu Asn 1265 1270 1275 Thr Leu Gly Leu Asp Glu Gly Ala Pro Arg His Ser Asn Leu Ser 1280 1285 1290 Trp Gln Ser Gln Thr Arg Arg Thr Pro Ser Leu Ser Ser Leu Asn 1295 1300 1305 Ser Gln Asp Ser Ser Ile Glu Ile Ser Lys Leu Thr Asp Lys Val 1310 1315 1320 Gln Ala Glu Tyr Arg Asp Ala Tyr Arg Glu Tyr Ile Ala Gln Met 1325 1330 1335 Ser Gln Leu Glu Gly Gly Pro Gly Ser Thr Thr Ile Ser Gly Arg 1340 1345 1350 Ser Ser Pro His Ser Thr Tyr Tyr Met Gly Gln Ser Ser Ser Gly 1355 1360 1365 Gly Ser Ile His Ser Asn Leu Glu Gln Glu Lys Gly Lys Asp Ser 1370 1375 1380 Glu Pro Lys Pro Asp Asp Gly Arg Lys Ser Phe Leu Met Lys Arg 1385 1390 1395 Gly Asp Val Ile Asp Tyr Ser Ser Ser Gly Val Ser Thr Asn Asp 1400 1405 1410 Ala Ser Pro Leu Asp Pro Ile Thr Glu Glu Asp Glu Lys Ser Asp 1415 1420 1425 Gln Ser Gly Ser Lys Leu Leu Pro Gly Lys Lys Ser Ser Glu Arg 1430 1435 1440 Ser Ser Leu Phe Gln Thr Asp Leu Lys Leu Lys Gly Ser Gly Leu 1445 1450 1455 Arg Tyr Gln Lys Leu Pro Ser Asp Glu Asp Glu Ser Gly Thr Glu 1460 1465 1470 Glu Ser Asp Asn Thr Pro Leu Leu Lys Asp Asp Lys Asp Arg Lys 1475 1480 1485 Ala Glu Gly Lys Val Glu Arg Val Pro Lys Ser Pro Glu His Ser 1490 1495 1500 Ala Glu Pro Ile Arg Thr Phe Ile Lys Ala Lys Glu Tyr Leu Ser 1505 1510 1515 Asp Ala Leu Leu Asp Lys Lys Asp Ser Ser Asp Ser Gly Val Arg 1520 1525 1530 Ser Ser Glu Ser Ser Pro Asn His Ser Leu His Asn Glu Val Ala 1535 1540 1545 Asp Asp Ser Gln Leu Glu Lys Ala Asn Leu Ile Glu Leu Glu Asp 1550 1555 1560 Asp Ser His Ser Gly Lys Arg Gly Ile Pro His Ser Leu Ser Gly 1565 1570 1575 Leu Gln Asp Pro Ile Ile Ala Arg Met Ser Ile Cys Ser Glu Asp 1580 1585 1590 Lys Lys Ser Pro Ser Glu Cys Ser Leu Ile Ala Ser Ser Pro Glu 1595 1600 1605 Glu Asn Trp Pro Ala Cys Gln Lys Ala Tyr Asn Leu Asn Arg Thr 1610 1615 1620 Pro Ser Thr Val Thr Leu Asn Asn Asn Ser Ala Pro Ala Asn Arg 1625 1630 1635 Ala Asn Gln Asn Phe Asp Glu Met Glu Gly Ile Arg Glu Thr Ser 1640 1645 1650 Gln Val Ile Leu Arg Pro Ser Ser Ser Pro Asn Pro Thr Thr Ile 1655 1660 1665 Gln Asn Glu Asn Leu Lys Ser Met Thr His Lys Arg Ser Gln Arg 1670 1675 1680 Ser Ser Tyr Thr Arg Leu Ser Lys Asp Pro Pro Glu Leu His Ala 1685 1690 1695 Ala Ala Ser Ser Glu Ser Thr Gly Phe Gly Glu Glu Arg Glu Ser 1700 1705 1710 Ile Leu 1715 5 4197 DNA Caenorhabditis Elegans CDS (1)..(4197) 5 atg gtt caa aca ctg aga aga ccg tgg caa gaa gcg gca agc agt gct 48 Met Val Gln Thr Leu Arg Arg Pro Trp Gln Glu Ala Ala Ser Ser Ala 1 5 10 15 ttt gcc gtc gcc agt gct ctt ccg gtg act atg aat agt aca caa att 96 Phe Ala Val Ala Ser Ala Leu Pro Val Thr Met Asn Ser Thr Gln Ile 20 25 30 gct gag ctc ttc gaa cag gtt gag cat ggc acc aca gaa ctc aga tgt 144 Ala Glu Leu Phe Glu Gln Val Glu His Gly Thr Thr Glu Leu Arg Cys 35 40 45 gca ctc acc gca gaa ata tca gct ctc aga aac gct aat gga gaa tca 192 Ala Leu Thr Ala Glu Ile Ser Ala Leu Arg Asn Ala Asn Gly Glu Ser 50 55 60 ctt ctc aca gtt gcc gtt cga tca gga aac aca gca gtt gcc aaa caa 240 Leu Leu Thr Val Ala Val Arg Ser Gly Asn Thr Ala Val Ala Lys Gln 65 70 75 80 tta gca caa ttg gat cca gac gca att gat gaa act gac aat gaa ggt 288 Leu Ala Gln Leu Asp Pro Asp Ala Ile Asp Glu Thr Asp Asn Glu Gly 85 90 95 tgg agt gct ctt ctg aat gca gct cac tgt ggt cat gtt gac att gtt 336 Trp Ser Ala Leu Leu Asn Ala Ala His Cys Gly His Val Asp Ile Val 100 105 110 cga tta tta att gat aat gga gca tct gtt gat caa cct gat tta atg 384 Arg Leu Leu Ile Asp Asn Gly Ala Ser Val Asp Gln Pro Asp Leu Met 115 120 125 gga tgg agt cct ctt atg tgg gca gtt tat aaa aat cat tta gat gtg 432 Gly Trp Ser Pro Leu Met Trp Ala Val Tyr Lys Asn His Leu Asp Val 130 135 140 gtt gat cta ctt gtt aac gca aaa gta aat gcc tgt gac aaa ttc gga 480 Val Asp Leu Leu Val Asn Ala Lys Val Asn Ala Cys Asp Lys Phe Gly 145 150 155 160 agt act gca tta att tgg gca gct cga aaa gga cat ctt cct gtt gta 528 Ser Thr Ala Leu Ile Trp Ala Ala Arg Lys Gly His Leu Pro Val Val 165 170 175 caa cta ctt tta aat tct gga gct gag gtg gat gca gtt gga atg tac 576 Gln Leu Leu Leu Asn Ser Gly Ala Glu Val Asp Ala Val Gly Met Tyr 180 185 190 tca tcc aca gct tta atg ttg gca aca cgt gga aac ttt att caa gta 624 Ser Ser Thr Ala Leu Met Leu Ala Thr Arg Gly Asn Phe Ile Gln Val 195 200 205 gtt gag ctt ctt ctg aca cga gaa cca aat gtt aat gta gct gat caa 672 Val Glu Leu Leu Leu Thr Arg Glu Pro Asn Val Asn Val Ala Asp Gln 210 215 220 aat ggt ttg aca gct ctc gga atg gca gct cgt gat gga tat gct gat 720 Asn Gly Leu Thr Ala Leu Gly Met Ala Ala Arg Asp Gly Tyr Ala Asp 225 230 235 240 ata tgt gaa tct ctt att aat tcg ggt gct ttt gtg aat cag tgt gat 768 Ile Cys Glu Ser Leu Ile Asn Ser Gly Ala Phe Val Asn Gln Cys Asp 245 250 255 aga ttt gga aac tgg att ctt acg tca gca gtt cga agt gga aat gcg 816 Arg Phe Gly Asn Trp Ile Leu Thr Ser Ala Val Arg Ser Gly Asn Ala 260 265 270 gca att gtt aga atg att ctc gac aaa ttt gct gat atc aat tgt caa 864 Ala Ile Val Arg Met Ile Leu Asp Lys Phe Ala Asp Ile Asn Cys Gln 275 280 285 gac tct gaa aaa cgg aca cca ctt cat ctg gcc atc gat aaa tca ttc 912 Asp Ser Glu Lys Arg Thr Pro Leu His Leu Ala Ile Asp Lys Ser Phe 290 295 300 aat gat atc gcc tac att tta ttg gag aaa aaa ccg aat ttg gaa ctg 960 Asn Asp Ile Ala Tyr Ile Leu Leu Glu Lys Lys Pro Asn Leu Glu Leu 305 310 315 320 aaa aat aag gac ggt gaa act cca ctt ctt cgt gct gct aaa tgt cgt 1008 Lys Asn Lys Asp Gly Glu Thr Pro Leu Leu Arg Ala Ala Lys Cys Arg 325 330 335 cat gta cat ctg tgt acc tac ttg atg agt ttc gga gct aaa ctc gct 1056 His Val His Leu Cys Thr Tyr Leu Met Ser Phe Gly Ala Lys Leu Ala 340 345 350 gca gtt gat aat tgt gga gat aat gca ctt cat ttg gca ctg aga gca 1104 Ala Val Asp Asn Cys Gly Asp Asn Ala Leu His Leu Ala Leu Arg Ala 355 360 365 cgt agt cga cga tta aca cag gca ctt ttg tcc aat cca tca gat tct 1152 Arg Ser Arg Arg Leu Thr Gln Ala Leu Leu Ser Asn Pro Ser Asp Ser 370 375 380 cgg tta ctg tac cga ccc aat aag ctt ggt cag aca ccg tat tcc att 1200 Arg Leu Leu Tyr Arg Pro Asn Lys Leu Gly Gln Thr Pro Tyr Ser Ile 385 390 395 400 gac tta tca aat cct caa cct att ctg cca cta att ttt gga cca att 1248 Asp Leu Ser Asn Pro Gln Pro Ile Leu Pro Leu Ile Phe Gly Pro Ile 405 410 415 gat gcg gaa gat aaa atg gat aca gct atg gga tat gat gta tat tct 1296 Asp Ala Glu Asp Lys Met Asp Thr Ala Met Gly Tyr Asp Val Tyr Ser 420 425 430 aat gtt tta gct gat att gtg tgt gaa cct tct ctt tct ctt cca ctt 1344 Asn Val Leu Ala Asp Ile Val Cys Glu Pro Ser Leu Ser Leu Pro Leu 435 440 445 aca att gga ctt tac gcc aaa tgg gga tcc gga aaa tca gca ctt ctc 1392 Thr Ile Gly Leu Tyr Ala Lys Trp Gly Ser Gly Lys Ser Ala Leu Leu 450 455 460 gca aaa ttg aaa gag gca atg cac agt ttc tca agg gat tgg ctt gat 1440 Ala Lys Leu Lys Glu Ala Met His Ser Phe Ser Arg Asp Trp Leu Asp 465 470 475 480 gga gtt tcg ctt tca gtt tcg ttt gcc tta ttc ttt gcc ata ttc ctc 1488 Gly Val Ser Leu Ser Val Ser Phe Ala Leu Phe Phe Ala Ile Phe Leu 485 490 495 ttt ttt gga atg ttc tca cta act ttt aca atg ctc att gcc att tct 1536 Phe Phe Gly Met Phe Ser Leu Thr Phe Thr Met Leu Ile Ala Ile Ser 500 505 510 aac tca gtc aca gca tat ctc atc tca tgg tcc gtc ttt ctt ctc att 1584 Asn Ser Val Thr Ala Tyr Leu Ile Ser Trp Ser Val Phe Leu Leu Ile 515 520 525 ttc atc ata ttc tgt tca ctg att gtt gtg gtt tat tat ggg gat cgc 1632 Phe Ile Ile Phe Cys Ser Leu Ile Val Val Val Tyr Tyr Gly Asp Arg 530 535 540 aag aac tgg tat act tca atg gat ata gcc aat ttt ttt gca cgg gtg 1680 Lys Asn Trp Tyr Thr Ser Met Asp Ile Ala Asn Phe Phe Ala Arg Val 545 550 555 560 ttc agt cgg atc cga ttg gtt tac aac att tta aca ctt cat gca ccg 1728 Phe Ser Arg Ile Arg Leu Val Tyr Asn Ile Leu Thr Leu His Ala Pro 565 570 575 atg aac agt gaa gat agt gca agt atg cct gtt agt ttt ctc ttt gcc 1776 Met Asn Ser Glu Asp Ser Ala Ser Met Pro Val Ser Phe Leu Phe Ala 580 585 590 gac tac cac cgt ctt tct tct att gga ggc gag caa gct ctt gct aaa 1824 Asp Tyr His Arg Leu Ser Ser Ile Gly Gly Glu Gln Ala Leu Ala Lys 595 600 605 atc gtg gcg aca ttg ttc gag gca gct gaa act cat ttt gga gtg ctt 1872 Ile Val Ala Thr Leu Phe Glu Ala Ala Glu Thr His Phe Gly Val Leu 610 615 620 cca gtt cga ctt ttc tgt tgc atg aaa cca cca tac cct gga att cat 1920 Pro Val Arg Leu Phe Cys Cys Met Lys Pro Pro Tyr Pro Gly Ile His 625 630 635 640 gga tct tta cgt cgt cac tgc ggt gtt ccc cat gtt att ctt ctg atc 1968 Gly Ser Leu Arg Arg His Cys Gly Val Pro His Val Ile Leu Leu Ile 645 650 655 gtt gcc gtc ttt cta cta atc atg gct caa gtt ttt gga aca gtt tgg 2016 Val Ala Val Phe Leu Leu Ile Met Ala Gln Val Phe Gly Thr Val Trp 660 665 670 cta ctt tct gat cga gat ccg aat aac ttc aat cta ttc ata gca att 2064 Leu Leu Ser Asp Arg Asp Pro Asn Asn Phe Asn Leu Phe Ile Ala Ile 675 680 685 gca ttc ctt tgt gga ttt gta atg att gcc atc tat cca tta gca tta 2112 Ala Phe Leu Cys Gly Phe Val Met Ile Ala Ile Tyr Pro Leu Ala Leu 690 695 700 att ata atg tat tca tgg aca aat gta cca aga cga aga gtc aat gca 2160 Ile Ile Met Tyr Ser Trp Thr Asn Val Pro Arg Arg Arg Val Asn Ala 705 710 715 720 gct gct cgg aat gct cat aaa tta cga ttt gaa ggt cta atg caa aag 2208 Ala Ala Arg Asn Ala His Lys Leu Arg Phe Glu Gly Leu Met Gln Lys 725 730 735 tta cag acc gag gtt gat ctt ctt gct gat atg atc cga tca ctc gat 2256 Leu Gln Thr Glu Val Asp Leu Leu Ala Asp Met Ile Arg Ser Leu Asp 740 745 750 gct ttc act aga agt cac aca cgt ctc gtt gtg gtg gtc gat ggg ctc 2304 Ala Phe Thr Arg Ser His Thr Arg Leu Val Val Val Val Asp Gly Leu 755 760 765 gat aat tgt gag cag gaa cga atg gtt cag act ctt gat gct ctt gag 2352 Asp Asn Cys Glu Gln Glu Arg Met Val Gln Thr Leu Asp Ala Leu Glu 770 775 780 cta tta ttc tcg gcc cgc aag cat cgc ccc ttc atc aca att att gct 2400 Leu Leu Phe Ser Ala Arg Lys His Arg Pro Phe Ile Thr Ile Ile Ala 785 790 795 800 gtg gat cca cat gta ata gta tct gca atc aat cat aat atg cat tca 2448 Val Asp Pro His Val Ile Val Ser Ala Ile Asn His Asn Met His Ser 805 810 815 gca ctg tct ggc act gaa ctc act gga cac gac tat ctg aaa aac atc 2496 Ala Leu Ser Gly Thr Glu Leu Thr Gly His Asp Tyr Leu Lys Asn Ile 820 825 830 atc agt atg ccg ttc tat ctt cat aat tct gct ctt cgt cag cta caa 2544 Ile Ser Met Pro Phe Tyr Leu His Asn Ser Ala Leu Arg Gln Leu Gln 835 840 845 tct aaa ctt cgt gaa aaa cga gaa tcg atg gca gaa tgg aaa gaa agg 2592 Ser Lys Leu Arg Glu Lys Arg Glu Ser Met Ala Glu Trp Lys Glu Arg 850 855 860 ttc aaa cgt caa gac aca ttc tat gga tca cat ctt tca ttg aga gaa 2640 Phe Lys Arg Gln Asp Thr Phe Tyr Gly Ser His Leu Ser Leu Arg Glu 865 870 875 880 gct gat gga aga aca agc aga aag aag tcc act gtt cca aac atg aac 2688 Ala Asp Gly Arg Thr Ser Arg Lys Lys Ser Thr Val Pro Asn Met Asn 885 890 895 tcg aat agt gtg gtt ggt cgg aat atg aat gac gga att ctt ggc gaa 2736 Ser Asn Ser Val Val Gly Arg Asn Met Asn Asp Gly Ile Leu Gly Glu 900 905 910 gac tac ttt tcc aat atg aat cct cgt gcc atg cgt cgt att gtc aat 2784 Asp Tyr Phe Ser Asn Met Asn Pro Arg Ala Met Arg Arg Ile Val Asn 915 920 925 gct tta act cta act gga cgt ctc atg aga gct ttt gaa att gac ttt 2832 Ala Leu Thr Leu Thr Gly Arg Leu Met Arg Ala Phe Glu Ile Asp Phe 930 935 940 tca tgg atg tca ctc gga cat tgg gta tct ctt ctc gaa caa tgg cct 2880 Ser Trp Met Ser Leu Gly His Trp Val Ser Leu Leu Glu Gln Trp Pro 945 950 955 960 tca cga atg tgt tgg ctt att gat aga gct ctt gaa gtt cac aac aat 2928 Ser Arg Met Cys Trp Leu Ile Asp Arg Ala Leu Glu Val His Asn Asn 965 970 975 cag ctt ctt cta tct gaa gta tac tat caa ctc aag gat cac att cca 2976 Gln Leu Leu Leu Ser Glu Val Tyr Tyr Gln Leu Lys Asp His Ile Pro 980 985 990 gct cag gat gat ttg atg cag ctc gac aga aat ttg gag aat ttt gaa 3024 Ala Gln Asp Asp Leu Met Gln Leu Asp Arg Asn Leu Glu Asn Phe Glu 995 1000 1005 ggt ttc cta gat tct aaa gga atc cca tca gca gag aga ctt act 3069 Gly Phe Leu Asp Ser Lys Gly Ile Pro Ser Ala Glu Arg Leu Thr 1010 1015 1020 gtt gga cat gtc aag aaa ttt gtc ccc tgt acc agc agt ttg gat 3114 Val Gly His Val Lys Lys Phe Val Pro Cys Thr Ser Ser Leu Asp 1025 1030 1035 cct tat ttg aga aag ttg att cgc gag aga agt aaa gga tta gtg 3159 Pro Tyr Leu Arg Lys Leu Ile Arg Glu Arg Ser Lys Gly Leu Val 1040 1045 1050 gat att gaa gca caa gtt gga agt gca gga atg gct att cca ccg 3204 Asp Ile Glu Ala Gln Val Gly Ser Ala Gly Met Ala Ile Pro Pro 1055 1060 1065 aat gcg aga ctt ttg ttt agt gat gat tta aca tgg atg tcg att 3249 Asn Ala Arg Leu Leu Phe Ser Asp Asp Leu Thr Trp Met Ser Ile 1070 1075 1080 gat act cca ctt gtc gaa atg aag ctt gat gca gtt gtc aac ctg 3294 Asp Thr Pro Leu Val Glu Met Lys Leu Asp Ala Val Val Asn Leu 1085 1090 1095 atc cgt aaa ata gac att cca tca aat cgc ctc gat tca att ctc 3339 Ile Arg Lys Ile Asp Ile Pro Ser Asn Arg Leu Asp Ser Ile Leu 1100 1105 1110 gat aga ttc tat caa tta aac tta tgt gga ttg gtt ctt gcc acg 3384 Asp Arg Phe Tyr Gln Leu Asn Leu Cys Gly Leu Val Leu Ala Thr 1115 1120 1125 tgt cca ttg cca gaa ctg aag gat agt atg caa tta cca ctt ggt 3429 Cys Pro Leu Pro Glu Leu Lys Asp Ser Met Gln Leu Pro Leu Gly 1130 1135 1140 gat tgg act ttg att cgt ttg ctt ctt gaa act ctt aaa gta ttt 3474 Asp Trp Thr Leu Ile Arg Leu Leu Leu Glu Thr Leu Lys Val Phe 1145 1150 1155 ggc tcc tct cca cca gga ctc cga gtg gat aaa cga aaa gca ctg 3519 Gly Ser Ser Pro Pro Gly Leu Arg Val Asp Lys Arg Lys Ala Leu 1160 1165 1170 aca ctc cgc gaa gag gat gag gaa gag gaa att gaa gag gca gca 3564 Thr Leu Arg Glu Glu Asp Glu Glu Glu Glu Ile Glu Glu Ala Ala 1175 1180 1185 gaa gct gct ttg aac tct gaa aaa gag cgt gct cca ctt ctt gga 3609 Glu Ala Ala Leu Asn Ser Glu Lys Glu Arg Ala Pro Leu Leu Gly 1190 1195 1200 tct gtg aga gct gaa cag aga aga cgg tcc aca att gtt cag aat 3654 Ser Val Arg Ala Glu Gln Arg Arg Arg Ser Thr Ile Val Gln Asn 1205 1210 1215 gcc act gag ctg agc att gat cac aaa tgt ctt atg gag aag tta 3699 Ala Thr Glu Leu Ser Ile Asp His Lys Cys Leu Met Glu Lys Leu 1220 1225 1230 tcg gga atg gat ttg act gaa acc gaa ggt gat gtc aat gaa atg 3744 Ser Gly Met Asp Leu Thr Glu Thr Glu Gly Asp Val Asn Glu Met 1235 1240 1245 cat ttc tct cat ttt tcg tcg tca act gat gga cca tct cca atg 3789 His Phe Ser His Phe Ser Ser Ser Thr Asp Gly Pro Ser Pro Met 1250 1255 1260 gct gat gga ttc ctt cca gct tca gtt tct gct gct cca tca gtc 3834 Ala Asp Gly Phe Leu Pro Ala Ser Val Ser Ala Ala Pro Ser Val 1265 1270 1275 aga ttc gac gat aat att aat gat ctg gaa cgt gaa gct tct gac 3879 Arg Phe Asp Asp Asn Ile Asn Asp Leu Glu Arg Glu Ala Ser Asp 1280 1285 1290 gct gac agt aca caa tcc cga tac gat tcg aaa gaa aat ctt ctt 3924 Ala Asp Ser Thr Gln Ser Arg Tyr Asp Ser Lys Glu Asn Leu Leu 1295 1300 1305 gaa gat gaa aga agt gct tct cca ccg gca cat gtt gat tta atg 3969 Glu Asp Glu Arg Ser Ala Ser Pro Pro Ala His Val Asp Leu Met 1310 1315 1320 aga ttt gac agt gga aat aca agc agt gat ttc gta cgg tcc agt 4014 Arg Phe Asp Ser Gly Asn Thr Ser Ser Asp Phe Val Arg Ser Ser 1325 1330 1335 tac cgt acc tca tcg aga tat cgt aga atg agc cga gga gac ttg 4059 Tyr Arg Thr Ser Ser Arg Tyr Arg Arg Met Ser Arg Gly Asp Leu 1340 1345 1350 atg cga gga atg cac cag aat gac aca atc cga agc aaa ata aat 4104 Met Arg Gly Met His Gln Asn Asp Thr Ile Arg Ser Lys Ile Asn 1355 1360 1365 ttt cag aac cag caa act gta caa cga cgt ggt gga gac gac ttt 4149 Phe Gln Asn Gln Gln Thr Val Gln Arg Arg Gly Gly Asp Asp Phe 1370 1375 1380 gag cgt atc caa ctt gaa caa cta ttc aat cga caa tcc gat gct 4194 Glu Arg Ile Gln Leu Glu Gln Leu Phe Asn Arg Gln Ser Asp Ala 1385 1390 1395 taa 4197 6 1398 PRT Caenorhabditis Elegans 6 Met Val Gln Thr Leu Arg Arg Pro Trp Gln Glu Ala Ala Ser Ser Ala 1 5 10 15 Phe Ala Val Ala Ser Ala Leu Pro Val Thr Met Asn Ser Thr Gln Ile 20 25 30 Ala Glu Leu Phe Glu Gln Val Glu His Gly Thr Thr Glu Leu Arg Cys 35 40 45 Ala Leu Thr Ala Glu Ile Ser Ala Leu Arg Asn Ala Asn Gly Glu Ser 50 55 60 Leu Leu Thr Val Ala Val Arg Ser Gly Asn Thr Ala Val Ala Lys Gln 65 70 75 80 Leu Ala Gln Leu Asp Pro Asp Ala Ile Asp Glu Thr Asp Asn Glu Gly 85 90 95 Trp Ser Ala Leu Leu Asn Ala Ala His Cys Gly His Val Asp Ile Val 100 105 110 Arg Leu Leu Ile Asp Asn Gly Ala Ser Val Asp Gln Pro Asp Leu Met 115 120 125 Gly Trp Ser Pro Leu Met Trp Ala Val Tyr Lys Asn His Leu Asp Val 130 135 140 Val Asp Leu Leu Val Asn Ala Lys Val Asn Ala Cys Asp Lys Phe Gly 145 150 155 160 Ser Thr Ala Leu Ile Trp Ala Ala Arg Lys Gly His Leu Pro Val Val 165 170 175 Gln Leu Leu Leu Asn Ser Gly Ala Glu Val Asp Ala Val Gly Met Tyr 180 185 190 Ser Ser Thr Ala Leu Met Leu Ala Thr Arg Gly Asn Phe Ile Gln Val 195 200 205 Val Glu Leu Leu Leu Thr Arg Glu Pro Asn Val Asn Val Ala Asp Gln 210 215 220 Asn Gly Leu Thr Ala Leu Gly Met Ala Ala Arg Asp Gly Tyr Ala Asp 225 230 235 240 Ile Cys Glu Ser Leu Ile Asn Ser Gly Ala Phe Val Asn Gln Cys Asp 245 250 255 Arg Phe Gly Asn Trp Ile Leu Thr Ser Ala Val Arg Ser Gly Asn Ala 260 265 270 Ala Ile Val Arg Met Ile Leu Asp Lys Phe Ala Asp Ile Asn Cys Gln 275 280 285 Asp Ser Glu Lys Arg Thr Pro Leu His Leu Ala Ile Asp Lys Ser Phe 290 295 300 Asn Asp Ile Ala Tyr Ile Leu Leu Glu Lys Lys Pro Asn Leu Glu Leu 305 310 315 320 Lys Asn Lys Asp Gly Glu Thr Pro Leu Leu Arg Ala Ala Lys Cys Arg 325 330 335 His Val His Leu Cys Thr Tyr Leu Met Ser Phe Gly Ala Lys Leu Ala 340 345 350 Ala Val Asp Asn Cys Gly Asp Asn Ala Leu His Leu Ala Leu Arg Ala 355 360 365 Arg Ser Arg Arg Leu Thr Gln Ala Leu Leu Ser Asn Pro Ser Asp Ser 370 375 380 Arg Leu Leu Tyr Arg Pro Asn Lys Leu Gly Gln Thr Pro Tyr Ser Ile 385 390 395 400 Asp Leu Ser Asn Pro Gln Pro Ile Leu Pro Leu Ile Phe Gly Pro Ile 405 410 415 Asp Ala Glu Asp Lys Met Asp Thr Ala Met Gly Tyr Asp Val Tyr Ser 420 425 430 Asn Val Leu Ala Asp Ile Val Cys Glu Pro Ser Leu Ser Leu Pro Leu 435 440 445 Thr Ile Gly Leu Tyr Ala Lys Trp Gly Ser Gly Lys Ser Ala Leu Leu 450 455 460 Ala Lys Leu Lys Glu Ala Met His Ser Phe Ser Arg Asp Trp Leu Asp 465 470 475 480 Gly Val Ser Leu Ser Val Ser Phe Ala Leu Phe Phe Ala Ile Phe Leu 485 490 495 Phe Phe Gly Met Phe Ser Leu Thr Phe Thr Met Leu Ile Ala Ile Ser 500 505 510 Asn Ser Val Thr Ala Tyr Leu Ile Ser Trp Ser Val Phe Leu Leu Ile 515 520 525 Phe Ile Ile Phe Cys Ser Leu Ile Val Val Val Tyr Tyr Gly Asp Arg 530 535 540 Lys Asn Trp Tyr Thr Ser Met Asp Ile Ala Asn Phe Phe Ala Arg Val 545 550 555 560 Phe Ser Arg Ile Arg Leu Val Tyr Asn Ile Leu Thr Leu His Ala Pro 565 570 575 Met Asn Ser Glu Asp Ser Ala Ser Met Pro Val Ser Phe Leu Phe Ala 580 585 590 Asp Tyr His Arg Leu Ser Ser Ile Gly Gly Glu Gln Ala Leu Ala Lys 595 600 605 Ile Val Ala Thr Leu Phe Glu Ala Ala Glu Thr His Phe Gly Val Leu 610 615 620 Pro Val Arg Leu Phe Cys Cys Met Lys Pro Pro Tyr Pro Gly Ile His 625 630 635 640 Gly Ser Leu Arg Arg His Cys Gly Val Pro His Val Ile Leu Leu Ile 645 650 655 Val Ala Val Phe Leu Leu Ile Met Ala Gln Val Phe Gly Thr Val Trp 660 665 670 Leu Leu Ser Asp Arg Asp Pro Asn Asn Phe Asn Leu Phe Ile Ala Ile 675 680 685 Ala Phe Leu Cys Gly Phe Val Met Ile Ala Ile Tyr Pro Leu Ala Leu 690 695 700 Ile Ile Met Tyr Ser Trp Thr Asn Val Pro Arg Arg Arg Val Asn Ala 705 710 715 720 Ala Ala Arg Asn Ala His Lys Leu Arg Phe Glu Gly Leu Met Gln Lys 725 730 735 Leu Gln Thr Glu Val Asp Leu Leu Ala Asp Met Ile Arg Ser Leu Asp 740 745 750 Ala Phe Thr Arg Ser His Thr Arg Leu Val Val Val Val Asp Gly Leu 755 760 765 Asp Asn Cys Glu Gln Glu Arg Met Val Gln Thr Leu Asp Ala Leu Glu 770 775 780 Leu Leu Phe Ser Ala Arg Lys His Arg Pro Phe Ile Thr Ile Ile Ala 785 790 795 800 Val Asp Pro His Val Ile Val Ser Ala Ile Asn His Asn Met His Ser 805 810 815 Ala Leu Ser Gly Thr Glu Leu Thr Gly His Asp Tyr Leu Lys Asn Ile 820 825 830 Ile Ser Met Pro Phe Tyr Leu His Asn Ser Ala Leu Arg Gln Leu Gln 835 840 845 Ser Lys Leu Arg Glu Lys Arg Glu Ser Met Ala Glu Trp Lys Glu Arg 850 855 860 Phe Lys Arg Gln Asp Thr Phe Tyr Gly Ser His Leu Ser Leu Arg Glu 865 870 875 880 Ala Asp Gly Arg Thr Ser Arg Lys Lys Ser Thr Val Pro Asn Met Asn 885 890 895 Ser Asn Ser Val Val Gly Arg Asn Met Asn Asp Gly Ile Leu Gly Glu 900 905 910 Asp Tyr Phe Ser Asn Met Asn Pro Arg Ala Met Arg Arg Ile Val Asn 915 920 925 Ala Leu Thr Leu Thr Gly Arg Leu Met Arg Ala Phe Glu Ile Asp Phe 930 935 940 Ser Trp Met Ser Leu Gly His Trp Val Ser Leu Leu Glu Gln Trp Pro 945 950 955 960 Ser Arg Met Cys Trp Leu Ile Asp Arg Ala Leu Glu Val His Asn Asn 965 970 975 Gln Leu Leu Leu Ser Glu Val Tyr Tyr Gln Leu Lys Asp His Ile Pro 980 985 990 Ala Gln Asp Asp Leu Met Gln Leu Asp Arg Asn Leu Glu Asn Phe Glu 995 1000 1005 Gly Phe Leu Asp Ser Lys Gly Ile Pro Ser Ala Glu Arg Leu Thr 1010 1015 1020 Val Gly His Val Lys Lys Phe Val Pro Cys Thr Ser Ser Leu Asp 1025 1030 1035 Pro Tyr Leu Arg Lys Leu Ile Arg Glu Arg Ser Lys Gly Leu Val 1040 1045 1050 Asp Ile Glu Ala Gln Val Gly Ser Ala Gly Met Ala Ile Pro Pro 1055 1060 1065 Asn Ala Arg Leu Leu Phe Ser Asp Asp Leu Thr Trp Met Ser Ile 1070 1075 1080 Asp Thr Pro Leu Val Glu Met Lys Leu Asp Ala Val Val Asn Leu 1085 1090 1095 Ile Arg Lys Ile Asp Ile Pro Ser Asn Arg Leu Asp Ser Ile Leu 1100 1105 1110 Asp Arg Phe Tyr Gln Leu Asn Leu Cys Gly Leu Val Leu Ala Thr 1115 1120 1125 Cys Pro Leu Pro Glu Leu Lys Asp Ser Met Gln Leu Pro Leu Gly 1130 1135 1140 Asp Trp Thr Leu Ile Arg Leu Leu Leu Glu Thr Leu Lys Val Phe 1145 1150 1155 Gly Ser Ser Pro Pro Gly Leu Arg Val Asp Lys Arg Lys Ala Leu 1160 1165 1170 Thr Leu Arg Glu Glu Asp Glu Glu Glu Glu Ile Glu Glu Ala Ala 1175 1180 1185 Glu Ala Ala Leu Asn Ser Glu Lys Glu Arg Ala Pro Leu Leu Gly 1190 1195 1200 Ser Val Arg Ala Glu Gln Arg Arg Arg Ser Thr Ile Val Gln Asn 1205 1210 1215 Ala Thr Glu Leu Ser Ile Asp His Lys Cys Leu Met Glu Lys Leu 1220 1225 1230 Ser Gly Met Asp Leu Thr Glu Thr Glu Gly Asp Val Asn Glu Met 1235 1240 1245 His Phe Ser His Phe Ser Ser Ser Thr Asp Gly Pro Ser Pro Met 1250 1255 1260 Ala Asp Gly Phe Leu Pro Ala Ser Val Ser Ala Ala Pro Ser Val 1265 1270 1275 Arg Phe Asp Asp Asn Ile Asn Asp Leu Glu Arg Glu Ala Ser Asp 1280 1285 1290 Ala Asp Ser Thr Gln Ser Arg Tyr Asp Ser Lys Glu Asn Leu Leu 1295 1300 1305 Glu Asp Glu Arg Ser Ala Ser Pro Pro Ala His Val Asp Leu Met 1310 1315 1320 Arg Phe Asp Ser Gly Asn Thr Ser Ser Asp Phe Val Arg Ser Ser 1325 1330 1335 Tyr Arg Thr Ser Ser Arg Tyr Arg Arg Met Ser Arg Gly Asp Leu 1340 1345 1350 Met Arg Gly Met His Gln Asn Asp Thr Ile Arg Ser Lys Ile Asn 1355 1360 1365 Phe Gln Asn Gln Gln Thr Val Gln Arg Arg Gly Gly Asp Asp Phe 1370 1375 1380 Glu Arg Ile Gln Leu Glu Gln Leu Phe Asn Arg Gln Ser Asp Ala 1385 1390 1395 7 4497 DNA Drosophila CDS (1)..(4497) 7 atg ttc aaa tcc cgc cta aag acg aat ccg ggt ggg agt gag aac ata 48 Met Phe Lys Ser Arg Leu Lys Thr Asn Pro Gly Gly Ser Glu Asn Ile 1 5 10 15 aat cgc ttc gga gac tcc atg ggt tcg ctg gga cat cgt gcc ttg ttg 96 Asn Arg Phe Gly Asp Ser Met Gly Ser Leu Gly His Arg Ala Leu Leu 20 25 30 cag tac atc gat aat aat gat ata tcc ggt ctg cgc gcc att ttg gac 144 Gln Tyr Ile Asp Asn Asn Asp Ile Ser Gly Leu Arg Ala Ile Leu Asp 35 40 45 agc cga cat ctt acc atc gat gat cgg gac gag aat gcc acc acg gtg 192 Ser Arg His Leu Thr Ile Asp Asp Arg Asp Glu Asn Ala Thr Thr Val 50 55 60 ttg atg gta gtt gct gga cga ggc tta act gcc ttt gtt cgc gag ttc 240 Leu Met Val Val Ala Gly Arg Gly Leu Thr Ala Phe Val Arg Glu Phe 65 70 75 80 ctg gcc cgc ggt gcc gat gtt caa gcg gag gat ctg gac aac tgg acg 288 Leu Ala Arg Gly Ala Asp Val Gln Ala Glu Asp Leu Asp Asn Trp Thr 85 90 95 gca ctg ctg tgt gcc tcc cga aac ggg cac tta gat gtt gtc caa ctg 336 Ala Leu Leu Cys Ala Ser Arg Asn Gly His Leu Asp Val Val Gln Leu 100 105 110 ctg ctg gac cat ggc gct gaa gtg gag cat cgc gat atg ggt ggc tgg 384 Leu Leu Asp His Gly Ala Glu Val Glu His Arg Asp Met Gly Gly Trp 115 120 125 acc agt ttg atg tgg gca gct tac cgt gga cac acg gag ctg gtg cga 432 Thr Ser Leu Met Trp Ala Ala Tyr Arg Gly His Thr Glu Leu Val Arg 130 135 140 ctg ctg ctg gac aaa gga gcc gat ggc aat gcc cat ggt aac tac cac 480 Leu Leu Leu Asp Lys Gly Ala Asp Gly Asn Ala His Gly Asn Tyr His 145 150 155 160 ctg ggc gcc ctt ttg tgg gcg gcg ggc cgt ggc tac aag gac atc gtg 528 Leu Gly Ala Leu Leu Trp Ala Ala Gly Arg Gly Tyr Lys Asp Ile Val 165 170 175 gag ctg ctg gtg caa cgc ggc gct aaa gtg aat gtg ggt gat aag tac 576 Glu Leu Leu Val Gln Arg Gly Ala Lys Val Asn Val Gly Asp Lys Tyr 180 185 190 gga acc aca gcc ctg gtg tgg gca tgc cgg cgc gga aac gtt gag att 624 Gly Thr Thr Ala Leu Val Trp Ala Cys Arg Arg Gly Asn Val Glu Ile 195 200 205 gtg gat acc tta ctt aag gct gga gcc aat gtg gac acc gct ggc atg 672 Val Asp Thr Leu Leu Lys Ala Gly Ala Asn Val Asp Thr Ala Gly Met 210 215 220 tat tcg tgg acg cca ctg ctc gtt gcg gcc gcc ggt ggc cac aca gat 720 Tyr Ser Trp Thr Pro Leu Leu Val Ala Ala Ala Gly Gly His Thr Asp 225 230 235 240 tgc gtt agt tcc att ctc gag aag aaa cca aat gta aat gcc ttg gac 768 Cys Val Ser Ser Ile Leu Glu Lys Lys Pro Asn Val Asn Ala Leu Asp 245 250 255 aag gat gga atg act gcc ctt tgc ata gcg agt cga gag ggt ttc cag 816 Lys Asp Gly Met Thr Ala Leu Cys Ile Ala Ser Arg Glu Gly Phe Gln 260 265 270 gat att gcc gcc tcc ctt att gcg gct ggt gcc tac ata aat atc cag 864 Asp Ile Ala Ala Ser Leu Ile Ala Ala Gly Ala Tyr Ile Asn Ile Gln 275 280 285 gat cgc gga gca gac acg ccg ctc atc cat gcc gtt aaa gca gga cat 912 Asp Arg Gly Ala Asp Thr Pro Leu Ile His Ala Val Lys Ala Gly His 290 295 300 cga act gta gtg gag gct ttg ctc aaa aaa cat gcc gat gtg gat ata 960 Arg Thr Val Val Glu Ala Leu Leu Lys Lys His Ala Asp Val Asp Ile 305 310 315 320 cag gga aag gat cgc aag acg gcc att tac acc gcg gtg gag aaa gga 1008 Gln Gly Lys Asp Arg Lys Thr Ala Ile Tyr Thr Ala Val Glu Lys Gly 325 330 335 cat acg ccg att gtt aaa ctc ctg ttg gcc acc aat ccc gac ctc gaa 1056 His Thr Pro Ile Val Lys Leu Leu Leu Ala Thr Asn Pro Asp Leu Glu 340 345 350 tcc gcc acc aag gat ggc gac act ccg ctg ctg aga gct gtt aga aac 1104 Ser Ala Thr Lys Asp Gly Asp Thr Pro Leu Leu Arg Ala Val Arg Asn 355 360 365 cgc aac ttg gag att gtg cac tta ttg cta gat cga aag gcc aag gtg 1152 Arg Asn Leu Glu Ile Val His Leu Leu Leu Asp Arg Lys Ala Lys Val 370 375 380 acc gct agc gac aaa agg ggc gac acc tgc ctg cac atc gca atg cgg 1200 Thr Ala Ser Asp Lys Arg Gly Asp Thr Cys Leu His Ile Ala Met Arg 385 390 395 400 gcg agg agc aag acg att gtg gag gcc ctt ctg cga aat ccc aag cac 1248 Ala Arg Ser Lys Thr Ile Val Glu Ala Leu Leu Arg Asn Pro Lys His 405 410 415 agt cag ctt cta tat aga gcc aac aaa gct gga gaa acg ccc tac aac 1296 Ser Gln Leu Leu Tyr Arg Ala Asn Lys Ala Gly Glu Thr Pro Tyr Asn 420 425 430 att gac tcc ctg cac cag aag acc ata ttg ggt caa gtg ttt ggc gcc 1344 Ile Asp Ser Leu His Gln Lys Thr Ile Leu Gly Gln Val Phe Gly Ala 435 440 445 aga cgt ctg aac acc aac gag gac tcc gag gga atg ctg ggc tac gaa 1392 Arg Arg Leu Asn Thr Asn Glu Asp Ser Glu Gly Met Leu Gly Tyr Glu 450 455 460 ctg tac tcc tct gcc ttg gcg gat gtg ctc agt gag ccg aca tta acc 1440 Leu Tyr Ser Ser Ala Leu Ala Asp Val Leu Ser Glu Pro Thr Leu Thr 465 470 475 480 act ccc att acc gtt gga ctc tac gcc aaa tgg ggc agt gga aag agt 1488 Thr Pro Ile Thr Val Gly Leu Tyr Ala Lys Trp Gly Ser Gly Lys Ser 485 490 495 ttc ctg cta aat aag ctg cgc gac gag atg aac aac ttt gcg cgc caa 1536 Phe Leu Leu Asn Lys Leu Arg Asp Glu Met Asn Asn Phe Ala Arg Gln 500 505 510 tgg gcg gaa cca ccg att cga acc agt ggt ctg ctc ttc atc gtc tgt 1584 Trp Ala Glu Pro Pro Ile Arg Thr Ser Gly Leu Leu Phe Ile Val Cys 515 520 525 ctt cat gtg gca ttg ctc att gga act att gtg gga ctg agc act tgg 1632 Leu His Val Ala Leu Leu Ile Gly Thr Ile Val Gly Leu Ser Thr Trp 530 535 540 tca gcg gtt gtg ggc gtg tcg gcg gcc gtc ggc ttc ttg ctg ctc gcc 1680 Ser Ala Val Val Gly Val Ser Ala Ala Val Gly Phe Leu Leu Leu Ala 545 550 555 560 tat ctg ctc ctg gca gct gtt aga tac tgt aat tac caa atg gac atg 1728 Tyr Leu Leu Leu Ala Ala Val Arg Tyr Cys Asn Tyr Gln Met Asp Met 565 570 575 cag tgg gcc tac tct gtg caa cac ggt ctg gag aag aga atg aca aga 1776 Gln Trp Ala Tyr Ser Val Gln His Gly Leu Glu Lys Arg Met Thr Arg 580 585 590 ttg cgt ttg ata ctt cag gtg gcc ttt tgc cat ccc ccg gga ccc caa 1824 Leu Arg Leu Ile Leu Gln Val Ala Phe Cys His Pro Pro Gly Pro Gln 595 600 605 tcg gat tca cag gca aag ccc gtt cgg ttt cac ttt gcg gag gcc aac 1872 Ser Asp Ser Gln Ala Lys Pro Val Arg Phe His Phe Ala Glu Ala Asn 610 615 620 agc gct tcg ccc aca ggt gat gga gca gtg gct cat atg ctg gcc gct 1920 Ser Ala Ser Pro Thr Gly Asp Gly Ala Val Ala His Met Leu Ala Ala 625 630 635 640 ctc tta gat gct ata gaa tct cac tac ggt tgg ctg gcc acg cga ttg 1968 Leu Leu Asp Ala Ile Glu Ser His Tyr Gly Trp Leu Ala Thr Arg Leu 645 650 655 tac cga gcc ttc cgt cca aag tgc tta aaa gtg gat gtt ggc tgg cgt 2016 Tyr Arg Ala Phe Arg Pro Lys Cys Leu Lys Val Asp Val Gly Trp Arg 660 665 670 tgg cgt cgc atg tgc tgt atc cca atc gtt ttg atc ttc gaa ctt gct 2064 Trp Arg Arg Met Cys Cys Ile Pro Ile Val Leu Ile Phe Glu Leu Ala 675 680 685 ctc gtc acc gtg gtt act ggc atc tct ctc acc gtg gcc tac ttc acg 2112 Leu Val Thr Val Val Thr Gly Ile Ser Leu Thr Val Ala Tyr Phe Thr 690 695 700 ttt gcc gat gag aag gaa aag gaa cat ata tta gtg gct ctc tat gta 2160 Phe Ala Asp Glu Lys Glu Lys Glu His Ile Leu Val Ala Leu Tyr Val 705 710 715 720 att gct gcc gtg atg ggc acg ctg atc tgt acg cat ctc cat gtt ctg 2208 Ile Ala Ala Val Met Gly Thr Leu Ile Cys Thr His Leu His Val Leu 725 730 735 gcc aag gtg ttt gta tcc ctg ttc acc tcc cac att cga gta cta aaa 2256 Ala Lys Val Phe Val Ser Leu Phe Thr Ser His Ile Arg Val Leu Lys 740 745 750 agg gca gtt cgg tcc agt gag tcg gct cca ttg act atg ctg ggt gcc 2304 Arg Ala Val Arg Ser Ser Glu Ser Ala Pro Leu Thr Met Leu Gly Ala 755 760 765 gag gtg gcc gtg atg acg gac atg gtc aag tgc ctg gac gcg ttc acc 2352 Glu Val Ala Val Met Thr Asp Met Val Lys Cys Leu Asp Ala Phe Thr 770 775 780 aat cag cag agc cgc tta gtt ggc gtg atc gat gcc ttg gat tcc tgc 2400 Asn Gln Gln Ser Arg Leu Val Gly Val Ile Asp Ala Leu Asp Ser Cys 785 790 795 800 gac acg gag agg att ctc acc ctt cta aat gcg gtg caa acg ctt ctt 2448 Asp Thr Glu Arg Ile Leu Thr Leu Leu Asn Ala Val Gln Thr Leu Leu 805 810 815 tct tca cct aac cga cca ttt gtc ttg ctc att tct gtg gat ccc cat 2496 Ser Ser Pro Asn Arg Pro Phe Val Leu Leu Ile Ser Val Asp Pro His 820 825 830 gtc ata gcc aaa gca gcg gag gct aat agt cga cga ctc ttt acg gag 2544 Val Ile Ala Lys Ala Ala Glu Ala Asn Ser Arg Arg Leu Phe Thr Glu 835 840 845 ggc gga atc gga gga cat gac ttc ctg aga aac ttg gtg cat ctg cct 2592 Gly Gly Ile Gly Gly His Asp Phe Leu Arg Asn Leu Val His Leu Pro 850 855 860 gtt tat ctg caa aac tcc gga ctg aga aag gtg caa cga gcc cag atg 2640 Val Tyr Leu Gln Asn Ser Gly Leu Arg Lys Val Gln Arg Ala Gln Met 865 870 875 880 aca gcg ctg ctg ttc aag cga agt ggc gga gga gat tac cag aca gac 2688 Thr Ala Leu Leu Phe Lys Arg Ser Gly Gly Gly Asp Tyr Gln Thr Asp 885 890 895 gat gga ccc aca ttg ggt cac tct gta tcc gct cgt cga ctg tcc aat 2736 Asp Gly Pro Thr Leu Gly His Ser Val Ser Ala Arg Arg Leu Ser Asn 900 905 910 gcc tct gaa ata atc tcc agt cag gag aag ctg cgc gga ccc gcc cgt 2784 Ala Ser Glu Ile Ile Ser Ser Gln Glu Lys Leu Arg Gly Pro Ala Arg 915 920 925 gga ggt ggt gga aag aag ctg cgt ctc tcc gaa tcc gtg gcc agc tca 2832 Gly Gly Gly Gly Lys Lys Leu Arg Leu Ser Glu Ser Val Ala Ser Ser 930 935 940 act ggt tcc aat ctc cat cgc ctg ggt cag aat ccc cag acc gtg ctc 2880 Thr Gly Ser Asn Leu His Arg Leu Gly Gln Asn Pro Gln Thr Val Leu 945 950 955 960 gat cta tcg cgc att gtg ctc acg gat gat tac ttc agc gat gtg aat 2928 Asp Leu Ser Arg Ile Val Leu Thr Asp Asp Tyr Phe Ser Asp Val Asn 965 970 975 cca cga agt atg cgt cgc cta atg aac gtg atc tac atc acg gtg cgc 2976 Pro Arg Ser Met Arg Arg Leu Met Asn Val Ile Tyr Ile Thr Val Arg 980 985 990 ttg ctc aag gcc ttc cag att gag ttc agc tgg tat cgc ctg agt tcg 3024 Leu Leu Lys Ala Phe Gln Ile Glu Phe Ser Trp Tyr Arg Leu Ser Ser 995 1000 1005 tgg atc aat ctg act gag cag tgg cct ttg aga gca agt atg ata 3069 Trp Ile Asn Leu Thr Glu Gln Trp Pro Leu Arg Ala Ser Met Ile 1010 1015 1020 gtg ctg cat cac gat cag ttc atg gac agc aat gcg gat gag agt 3114 Val Leu His His Asp Gln Phe Met Asp Ser Asn Ala Asp Glu Ser 1025 1030 1035 gta tcg ctg caa agc gtt tac gag aaa ttg cga ccc aaa ctc gct 3159 Val Ser Leu Gln Ser Val Tyr Glu Lys Leu Arg Pro Lys Leu Ala 1040 1045 1050 tac ttg cga gaa gct gct cca ctt cta gag ttg gat agg gat gaa 3204 Tyr Leu Arg Glu Ala Ala Pro Leu Leu Glu Leu Asp Arg Asp Glu 1055 1060 1065 cga aag ctc gac gcc ttc ctg cag ctg cac aaa tca gat tta ctc 3249 Arg Lys Leu Asp Ala Phe Leu Gln Leu His Lys Ser Asp Leu Leu 1070 1075 1080 gtg gcg gat ctc cgc atc ttc ttg ccc ttc acc att aat ctg gat 3294 Val Ala Asp Leu Arg Ile Phe Leu Pro Phe Thr Ile Asn Leu Asp 1085 1090 1095 ccc tac tta aga aag gtc tta aag gag gat cag caa acc atc gag 3339 Pro Tyr Leu Arg Lys Val Leu Lys Glu Asp Gln Gln Thr Ile Glu 1100 1105 1110 gac gag ggc tcc cta gtg ata caa gca agg ccc agc gtt tcc aat 3384 Asp Glu Gly Ser Leu Val Ile Gln Ala Arg Pro Ser Val Ser Asn 1115 1120 1125 aca atg cgt caa ttc cca gcg ccc acc act tac gtg cct tcg ccg 3429 Thr Met Arg Gln Phe Pro Ala Pro Thr Thr Tyr Val Pro Ser Pro 1130 1135 1140 cag gct tat cca ccc tac caa atg ttc cag aac gag tat cct gcc 3474 Gln Ala Tyr Pro Pro Tyr Gln Met Phe Gln Asn Glu Tyr Pro Ala 1145 1150 1155 aat gag ttg cgc tcc agg aat ctc agc acg agc aca gag cct gtc 3519 Asn Glu Leu Arg Ser Arg Asn Leu Ser Thr Ser Thr Glu Pro Val 1160 1165 1170 acc ccg ctc att aac tca cct agt gat tcg ttt ggt gat gat atc 3564 Thr Pro Leu Ile Asn Ser Pro Ser Asp Ser Phe Gly Asp Asp Ile 1175 1180 1185 ctg caa acc aag ctg acc gat ttg acc gtt gag gga gtc atc agc 3609 Leu Gln Thr Lys Leu Thr Asp Leu Thr Val Glu Gly Val Ile Ser 1190 1195 1200 ctg ctg gat cgg att gag gac atg aag cca gcg ttg ccc aaa cta 3654 Leu Leu Asp Arg Ile Glu Asp Met Lys Pro Ala Leu Pro Lys Leu 1205 1210 1215 gcg ccc gtg ctc cgc gag aat gca atc aat gga cgt gtg ctg aag 3699 Ala Pro Val Leu Arg Glu Asn Ala Ile Asn Gly Arg Val Leu Lys 1220 1225 1230 cac tgc gat atg ccg gat ctg aaa tcg gtt ctg ggc ctg agc ttt 3744 His Cys Asp Met Pro Asp Leu Lys Ser Val Leu Gly Leu Ser Phe 1235 1240 1245 ggc cac tgg gag ctg ttc cgc ctg ctg atc acc acg ttg cgg gaa 3789 Gly His Trp Glu Leu Phe Arg Leu Leu Ile Thr Thr Leu Arg Glu 1250 1255 1260 tgc gag cga ttg cca cgg aag caa cag cgt cag cag cag cag ccg 3834 Cys Glu Arg Leu Pro Arg Lys Gln Gln Arg Gln Gln Gln Gln Pro 1265 1270 1275 ggt gcg ttg gag gca cca tcg aat gtc ccg atg att aag gat gtg 3879 Gly Ala Leu Glu Ala Pro Ser Asn Val Pro Met Ile Lys Asp Val 1280 1285 1290 acg gat gcc ctg atg cag cca ccc aga gag tcc ctc tcg cga aag 3924 Thr Asp Ala Leu Met Gln Pro Pro Arg Glu Ser Leu Ser Arg Lys 1295 1300 1305 aac tcc gtc agt cat atg gaa aag cag gtc acg ctg gag gag cag 3969 Asn Ser Val Ser His Met Glu Lys Gln Val Thr Leu Glu Glu Gln 1310 1315 1320 atg atc tgc ggc acc ttg cag act ctg aat gag gag gcg tac gag 4014 Met Ile Cys Gly Thr Leu Gln Thr Leu Asn Glu Glu Ala Tyr Glu 1325 1330 1335 gat gtg gcc agt agc gag cga ccg agt ccc aca ggt gag atg ttg 4059 Asp Val Ala Ser Ser Glu Arg Pro Ser Pro Thr Gly Glu Met Leu 1340 1345 1350 gca gca gtc gca caa ctg caa tta gca ccc atc cgc gag tct tcc 4104 Ala Ala Val Ala Gln Leu Gln Leu Ala Pro Ile Arg Glu Ser Ser 1355 1360 1365 gag ttc gga tcg cct tcc gac gat cag aag cag tat ggg gtc aag 4149 Glu Phe Gly Ser Pro Ser Asp Asp Gln Lys Gln Tyr Gly Val Lys 1370 1375 1380 ata agc aac aac aac aac aac aac cag tac ttg cat gcg gag tac 4194 Ile Ser Asn Asn Asn Asn Asn Asn Gln Tyr Leu His Ala Glu Tyr 1385 1390 1395 aac cgg agc gtc agc tcg cat tcc ctg cag agt ctg agc act ctg 4239 Asn Arg Ser Val Ser Ser His Ser Leu Gln Ser Leu Ser Thr Leu 1400 1405 1410 gtg ggt gca ccc gtt ggt cat ggt ggt tca ggt ggc agt cac cta 4284 Val Gly Ala Pro Val Gly His Gly Gly Ser Gly Gly Ser His Leu 1415 1420 1425 cac ctg ggc aat ggt aac gat ctg agc gat agc acg ctc gat cta 4329 His Leu Gly Asn Gly Asn Asp Leu Ser Asp Ser Thr Leu Asp Leu 1430 1435 1440 atg cat gta gac tcg gta ttt ggt ggt ggt ggt gga ggt ggt tat 4374 Met His Val Asp Ser Val Phe Gly Gly Gly Gly Gly Gly Gly Tyr 1445 1450 1455 cat cgg gca tcg cgc cag atc tcg ata agc agc gag ctg ctg cac 4419 His Arg Ala Ser Arg Gln Ile Ser Ile Ser Ser Glu Leu Leu His 1460 1465 1470 gag tcc agt tcc ttg ccc atg gtg gtg gtc att ccc aat gcc tcg 4464 Glu Ser Ser Ser Leu Pro Met Val Val Val Ile Pro Asn Ala Ser 1475 1480 1485 ggc gaa cag gcc tac gac gac acc aag ctc tga 4497 Gly Glu Gln Ala Tyr Asp Asp Thr Lys Leu 1490 1495 8 1498 PRT Drosophila 8 Met Phe Lys Ser Arg Leu Lys Thr Asn Pro Gly Gly Ser Glu Asn Ile 1 5 10 15 Asn Arg Phe Gly Asp Ser Met Gly Ser Leu Gly His Arg Ala Leu Leu 20 25 30 Gln Tyr Ile Asp Asn Asn Asp Ile Ser Gly Leu Arg Ala Ile Leu Asp 35 40 45 Ser Arg His Leu Thr Ile Asp Asp Arg Asp Glu Asn Ala Thr Thr Val 50 55 60 Leu Met Val Val Ala Gly Arg Gly Leu Thr Ala Phe Val Arg Glu Phe 65 70 75 80 Leu Ala Arg Gly Ala Asp Val Gln Ala Glu Asp Leu Asp Asn Trp Thr 85 90 95 Ala Leu Leu Cys Ala Ser Arg Asn Gly His Leu Asp Val Val Gln Leu 100 105 110 Leu Leu Asp His Gly Ala Glu Val Glu His Arg Asp Met Gly Gly Trp 115 120 125 Thr Ser Leu Met Trp Ala Ala Tyr Arg Gly His Thr Glu Leu Val Arg 130 135 140 Leu Leu Leu Asp Lys Gly Ala Asp Gly Asn Ala His Gly Asn Tyr His 145 150 155 160 Leu Gly Ala Leu Leu Trp Ala Ala Gly Arg Gly Tyr Lys Asp Ile Val 165 170 175 Glu Leu Leu Val Gln Arg Gly Ala Lys Val Asn Val Gly Asp Lys Tyr 180 185 190 Gly Thr Thr Ala Leu Val Trp Ala Cys Arg Arg Gly Asn Val Glu Ile 195 200 205 Val Asp Thr Leu Leu Lys Ala Gly Ala Asn Val Asp Thr Ala Gly Met 210 215 220 Tyr Ser Trp Thr Pro Leu Leu Val Ala Ala Ala Gly Gly His Thr Asp 225 230 235 240 Cys Val Ser Ser Ile Leu Glu Lys Lys Pro Asn Val Asn Ala Leu Asp 245 250 255 Lys Asp Gly Met Thr Ala Leu Cys Ile Ala Ser Arg Glu Gly Phe Gln 260 265 270 Asp Ile Ala Ala Ser Leu Ile Ala Ala Gly Ala Tyr Ile Asn Ile Gln 275 280 285 Asp Arg Gly Ala Asp Thr Pro Leu Ile His Ala Val Lys Ala Gly His 290 295 300 Arg Thr Val Val Glu Ala Leu Leu Lys Lys His Ala Asp Val Asp Ile 305 310 315 320 Gln Gly Lys Asp Arg Lys Thr Ala Ile Tyr Thr Ala Val Glu Lys Gly 325 330 335 His Thr Pro Ile Val Lys Leu Leu Leu Ala Thr Asn Pro Asp Leu Glu 340 345 350 Ser Ala Thr Lys Asp Gly Asp Thr Pro Leu Leu Arg Ala Val Arg Asn 355 360 365 Arg Asn Leu Glu Ile Val His Leu Leu Leu Asp Arg Lys Ala Lys Val 370 375 380 Thr Ala Ser Asp Lys Arg Gly Asp Thr Cys Leu His Ile Ala Met Arg 385 390 395 400 Ala Arg Ser Lys Thr Ile Val Glu Ala Leu Leu Arg Asn Pro Lys His 405 410 415 Ser Gln Leu Leu Tyr Arg Ala Asn Lys Ala Gly Glu Thr Pro Tyr Asn 420 425 430 Ile Asp Ser Leu His Gln Lys Thr Ile Leu Gly Gln Val Phe Gly Ala 435 440 445 Arg Arg Leu Asn Thr Asn Glu Asp Ser Glu Gly Met Leu Gly Tyr Glu 450 455 460 Leu Tyr Ser Ser Ala Leu Ala Asp Val Leu Ser Glu Pro Thr Leu Thr 465 470 475 480 Thr Pro Ile Thr Val Gly Leu Tyr Ala Lys Trp Gly Ser Gly Lys Ser 485 490 495 Phe Leu Leu Asn Lys Leu Arg Asp Glu Met Asn Asn Phe Ala Arg Gln 500 505 510 Trp Ala Glu Pro Pro Ile Arg Thr Ser Gly Leu Leu Phe Ile Val Cys 515 520 525 Leu His Val Ala Leu Leu Ile Gly Thr Ile Val Gly Leu Ser Thr Trp 530 535 540 Ser Ala Val Val Gly Val Ser Ala Ala Val Gly Phe Leu Leu Leu Ala 545 550 555 560 Tyr Leu Leu Leu Ala Ala Val Arg Tyr Cys Asn Tyr Gln Met Asp Met 565 570 575 Gln Trp Ala Tyr Ser Val Gln His Gly Leu Glu Lys Arg Met Thr Arg 580 585 590 Leu Arg Leu Ile Leu Gln Val Ala Phe Cys His Pro Pro Gly Pro Gln 595 600 605 Ser Asp Ser Gln Ala Lys Pro Val Arg Phe His Phe Ala Glu Ala Asn 610 615 620 Ser Ala Ser Pro Thr Gly Asp Gly Ala Val Ala His Met Leu Ala Ala 625 630 635 640 Leu Leu Asp Ala Ile Glu Ser His Tyr Gly Trp Leu Ala Thr Arg Leu 645 650 655 Tyr Arg Ala Phe Arg Pro Lys Cys Leu Lys Val Asp Val Gly Trp Arg 660 665 670 Trp Arg Arg Met Cys Cys Ile Pro Ile Val Leu Ile Phe Glu Leu Ala 675 680 685 Leu Val Thr Val Val Thr Gly Ile Ser Leu Thr Val Ala Tyr Phe Thr 690 695 700 Phe Ala Asp Glu Lys Glu Lys Glu His Ile Leu Val Ala Leu Tyr Val 705 710 715 720 Ile Ala Ala Val Met Gly Thr Leu Ile Cys Thr His Leu His Val Leu 725 730 735 Ala Lys Val Phe Val Ser Leu Phe Thr Ser His Ile Arg Val Leu Lys 740 745 750 Arg Ala Val Arg Ser Ser Glu Ser Ala Pro Leu Thr Met Leu Gly Ala 755 760 765 Glu Val Ala Val Met Thr Asp Met Val Lys Cys Leu Asp Ala Phe Thr 770 775 780 Asn Gln Gln Ser Arg Leu Val Gly Val Ile Asp Ala Leu Asp Ser Cys 785 790 795 800 Asp Thr Glu Arg Ile Leu Thr Leu Leu Asn Ala Val Gln Thr Leu Leu 805 810 815 Ser Ser Pro Asn Arg Pro Phe Val Leu Leu Ile Ser Val Asp Pro His 820 825 830 Val Ile Ala Lys Ala Ala Glu Ala Asn Ser Arg Arg Leu Phe Thr Glu 835 840 845 Gly Gly Ile Gly Gly His Asp Phe Leu Arg Asn Leu Val His Leu Pro 850 855 860 Val Tyr Leu Gln Asn Ser Gly Leu Arg Lys Val Gln Arg Ala Gln Met 865 870 875 880 Thr Ala Leu Leu Phe Lys Arg Ser Gly Gly Gly Asp Tyr Gln Thr Asp 885 890 895 Asp Gly Pro Thr Leu Gly His Ser Val Ser Ala Arg Arg Leu Ser Asn 900 905 910 Ala Ser Glu Ile Ile Ser Ser Gln Glu Lys Leu Arg Gly Pro Ala Arg 915 920 925 Gly Gly Gly Gly Lys Lys Leu Arg Leu Ser Glu Ser Val Ala Ser Ser 930 935 940 Thr Gly Ser Asn Leu His Arg Leu Gly Gln Asn Pro Gln Thr Val Leu 945 950 955 960 Asp Leu Ser Arg Ile Val Leu Thr Asp Asp Tyr Phe Ser Asp Val Asn 965 970 975 Pro Arg Ser Met Arg Arg Leu Met Asn Val Ile Tyr Ile Thr Val Arg 980 985 990 Leu Leu Lys Ala Phe Gln Ile Glu Phe Ser Trp Tyr Arg Leu Ser Ser 995 1000 1005 Trp Ile Asn Leu Thr Glu Gln Trp Pro Leu Arg Ala Ser Met Ile 1010 1015 1020 Val Leu His His Asp Gln Phe Met Asp Ser Asn Ala Asp Glu Ser 1025 1030 1035 Val Ser Leu Gln Ser Val Tyr Glu Lys Leu Arg Pro Lys Leu Ala 1040 1045 1050 Tyr Leu Arg Glu Ala Ala Pro Leu Leu Glu Leu Asp Arg Asp Glu 1055 1060 1065 Arg Lys Leu Asp Ala Phe Leu Gln Leu His Lys Ser Asp Leu Leu 1070 1075 1080 Val Ala Asp Leu Arg Ile Phe Leu Pro Phe Thr Ile Asn Leu Asp 1085 1090 1095 Pro Tyr Leu Arg Lys Val Leu Lys Glu Asp Gln Gln Thr Ile Glu 1100 1105 1110 Asp Glu Gly Ser Leu Val Ile Gln Ala Arg Pro Ser Val Ser Asn 1115 1120 1125 Thr Met Arg Gln Phe Pro Ala Pro Thr Thr Tyr Val Pro Ser Pro 1130 1135 1140 Gln Ala Tyr Pro Pro Tyr Gln Met Phe Gln Asn Glu Tyr Pro Ala 1145 1150 1155 Asn Glu Leu Arg Ser Arg Asn Leu Ser Thr Ser Thr Glu Pro Val 1160 1165 1170 Thr Pro Leu Ile Asn Ser Pro Ser Asp Ser Phe Gly Asp Asp Ile 1175 1180 1185 Leu Gln Thr Lys Leu Thr Asp Leu Thr Val Glu Gly Val Ile Ser 1190 1195 1200 Leu Leu Asp Arg Ile Glu Asp Met Lys Pro Ala Leu Pro Lys Leu 1205 1210 1215 Ala Pro Val Leu Arg Glu Asn Ala Ile Asn Gly Arg Val Leu Lys 1220 1225 1230 His Cys Asp Met Pro Asp Leu Lys Ser Val Leu Gly Leu Ser Phe 1235 1240 1245 Gly His Trp Glu Leu Phe Arg Leu Leu Ile Thr Thr Leu Arg Glu 1250 1255 1260 Cys Glu Arg Leu Pro Arg Lys Gln Gln Arg Gln Gln Gln Gln Pro 1265 1270 1275 Gly Ala Leu Glu Ala Pro Ser Asn Val Pro Met Ile Lys Asp Val 1280 1285 1290 Thr Asp Ala Leu Met Gln Pro Pro Arg Glu Ser Leu Ser Arg Lys 1295 1300 1305 Asn Ser Val Ser His Met Glu Lys Gln Val Thr Leu Glu Glu Gln 1310 1315 1320 Met Ile Cys Gly Thr Leu Gln Thr Leu Asn Glu Glu Ala Tyr Glu 1325 1330 1335 Asp Val Ala Ser Ser Glu Arg Pro Ser Pro Thr Gly Glu Met Leu 1340 1345 1350 Ala Ala Val Ala Gln Leu Gln Leu Ala Pro Ile Arg Glu Ser Ser 1355 1360 1365 Glu Phe Gly Ser Pro Ser Asp Asp Gln Lys Gln Tyr Gly Val Lys 1370 1375 1380 Ile Ser Asn Asn Asn Asn Asn Asn Gln Tyr Leu His Ala Glu Tyr 1385 1390 1395 Asn Arg Ser Val Ser Ser His Ser Leu Gln Ser Leu Ser Thr Leu 1400 1405 1410 Val Gly Ala Pro Val Gly His Gly Gly Ser Gly Gly Ser His Leu 1415 1420 1425 His Leu Gly Asn Gly Asn Asp Leu Ser Asp Ser Thr Leu Asp Leu 1430 1435 1440 Met His Val Asp Ser Val Phe Gly Gly Gly Gly Gly Gly Gly Tyr 1445 1450 1455 His Arg Ala Ser Arg Gln Ile Ser Ile Ser Ser Glu Leu Leu His 1460 1465 1470 Glu Ser Ser Ser Leu Pro Met Val Val Val Ile Pro Asn Ala Ser 1475 1480 1485 Gly Glu Gln Ala Tyr Asp Asp Thr Lys Leu 1490 1495 

What is claimed is:
 1. An isolated polypeptide, which associates with TrkA and p75 neurotrophin receptors, is a target for phosphorylation by neurotrophin and ephrin receptor tyrosine kinases, enhances neurotransmitter release, and modulates the clustering of proteins involved in ion channel formation, comprising the amino acid sequence of: (A) SEQ ID NO:2; (B) SEQ ID NO:4; (C) a fragment of the polypeptide of SEQ ID NO:2; (D) a fragment of the polypeptide of SEQ ID NO:4; (E) a variant polypeptide which is at least 95% identical to SEQ ID NO:2; (F) a variant polypeptide which is at least 95% identical to SEQ ID NO:4; or (G) a functional derivative or a salt of (A), (B), (C), (D), (E), or (F) wherein said fragments (C) and (D) and said variants (E) and (F) have the properties of associating with TrkA and p75 neurotrophin receptors, being a target for phosphorylation by neurotrophin and ephrin receptor tyrosine kinase, enhancing neurotransmitter release, and modulating the clustering of proteins involved in ion channel formation.
 2. The polypeptide of claim 1, which comprises the amino acid sequence of SEQ ID NO:2.
 3. The polypeptide of claim 1, which comprises the amino acid sequence of SEQ ID NO:4.
 4. The polypeptide of claim 1, which comprises the amino acid sequence of fragment (C).
 5. The polypeptide of claim 4, wherein said fragment (C) either further contains one or more transmembrane domains of the polypeptide of SEQ ID NO:2 or is fused to a transmembrane domain to form a fusion polypeptide.
 6. The polypeptide of claim 1, which comprises the amino acid sequence of fragment (D).
 7. The polypeptide of claim 6, wherein said fragment (D) either further contains one or more transmembrane domain of the polypeptide of SEQ ID NO:4 or is fused to a transmembrane domain to form a fusion polypeptide.
 8. The polypeptide of claim 1, which comprises the amino acid sequence of a variant polypeptide which is at least 95% identical to SEQ ID NO:2.
 9. The polypeptide of claim 1, which comprises the amino acid sequence of a variant polypeptide which is at least 95% identical to SEQ ID NO:4.
 10. A molecule which contains the antigen-binding portion of an antibody specific for the polypeptide of claim
 1. 11. The molecule of claim 10, which is a monoclonal antibody.
 12. The molecule of claim 10, which is a single chain antibody.
 13. The molecule of claim 10, which is specific for a polypeptide of SEQ ID NO:2.
 14. A method for visualizing the growth cone of neurons, comprising: contacting the molecule of claim 13 with neurons to detect the presence of the polypeptide of SEQ ID NO:2 as a marker for the growth cone of neurons; and visualizing the growth cone of neurons by the binding of the molecule of claim 13 to the polypeptide of SEQ ID NO:2 localized in the growth cone of neurons.
 15. The molecule of claim 10, which is specific for a polypeptide of SEQ ID NO:4.
 16. A method for visualizing the growth cone of neurons, comprising: contacting the molecule of claim 15 with neurons to detect the presence of the polypeptide of SEQ ID NO:4 as a marker for the growth cone of neurons; and visualizing the growth cone of neurons by the binding of the molecule of claim 15 to the polypeptide of SEQ ID NO:4 localized in the growth cone of neurons.
 17. An isolated nucleic acid encoding the polypeptide of claim
 1. 18. The nucleic acid of claim 17, which encodes a polypeptide comprising SEQ ID NO:2.
 19. The nucleic acid of claim 18, comprising a nucleotide sequence of SEQ ID NO:1.
 20. The nucleic acid of claim 17, which encodes a polypeptide comprising SEQ ID NO:4.
 21. The nucleic acid of claim 20, comprising a nucleotide sequence of SEQ ID NO:3.
 22. The nucleic acid of claim 17, which encodes the amino acid sequence of fragment (C).
 23. The nucleic acid of claim 17, which encodes the amino acid sequence of fragment (D).
 24. The nucleic acid of claim 17, which encodes a variant polypeptide which is at least 95% identical to SEQ ID NO:2.
 25. The nucleic acid of claim 17, which encodes a variant polypeptide which is at least 95% identical to SEQ ID NO:4.
 26. A vector comprising the nucleic acid of claim
 17. 27. A host cell transformed with the nucleic acid of claim
 17. 28. A method for producing a polypeptide which associates with TrkA and p75 neurotrophin receptors, is a target for phosphorylation by neurotrophin and ephrin receptor tyrosine kinases, enhances neurotransmitter release, and modulates the clustering of proteins involved in ion channel formation, comprising: culturing the host cell of claim 27 in a nutrient medium to express and produce the polypeptide; and recovering the produced polypeptide.
 29. An isolated nucleic acid, which specifically hybridizes under stringent conditions to the complement of either SEQ ID NO:1 or SEQ ID NO:3.
 30. The nucleic acid of claim 29, which specifically hybridizes under highly stringent conditions to the complement of SEQ ID NO:1.
 31. The nucleic acid of claim 29, which specifically hybridizes under hybridizes under highly stringent conditions to the complement of SEQ ID NO:3.
 32. A vector comprising the nucleic acid of claim
 29. 33. A host cell transformed with the nucleic acid of claim
 29. 34. A method for producing a polypeptide which associates with TrkA and p75 neurotrophin receptors, is a target for phosphorylation by neurotrophin and ephrin receptor tyrosine kinases enhances neurotransmitter release, and modulates the clustering of proteins involved in ion channel formation, comprising: culturing the host cell of claim 33 in a nutrient medium to express and produce the polypeptide; and recovering the produced polypeptide. 